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Appl. No. 09/895,435 

Amdt. dated June 8, 2004 

Reply to Office Action of March 8, 2004 



Remarks 

Claims 1-5, 11-13, 15-20 and 53-67 were pending and examined in the Office 
Action of March 8, 2004. Of those claims, 1-5, 11-13, 15-20, 53-55, 57-60, 62-67 stand 
rejected and 56, 61 and 66 are objected to. With this Amendment and Reply, claims 1, 
2, 3, 13, 16, 18, 19 and 20 are amended, and claims 68-75 are newly added, to more 
particularly point out and distinctly claim the invention. 

Applicants note with appreciation the withdrawal of the restriction between 
Groups I and II, and the finding that claims 56, 61 and 66 are only objected to. 

Remarks on the rejections entered in the Office Action of March 8, 2004 follow. 

Rejections under 35 U.S.C. 112, first paragraph 

Claims 1-5, 11-13, 15-20, 54, 55, 57, 59, 60, 64, 65 and 67 stand rejected under 
35 U.S.C. 112, first paragraph as not being described in the specification sufficientiy to 
convey to the skilled artisan that the inventors had possession of the invention (written 
description rejection). It is asserted that the claimed functional variants are not 
sufficientiy described in the specification. Applicant respectfully requests reconsideration 
and withdrawal of these rejections in light of the claim amendments, the enclosed 
Declaration of Bernard Hallet under 37 C.F.R. 1.132, and the following discussion. 

Applicants first note that the claims as amended limit the functional variants 
within the scope of the invention to those that are altered from SEQ ID N0:3 only in the 
central crossover region. These functional variants are sufficientiy described, e.g., at 
paragraph 61 on page 5 of the specification. 

Applicants also assert that functional variants of TRT (SEQ ID N0:3) altered only 
in the central crossover region would be understood to be possessed by the inventors at 
the time of filing, since the literature at that time establishes that variants of a known 
recombination site for a tyrosine recombinase that vary only in the central crossover 
region would be expected to be functional. See enclosed Declaration of Bernard Hallet 
under 37 C.F.R. 1.132. As discussed therein, experimental data with tyrosine 
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recombinases, as well as the knowledge of the mechanism of tyrosine recombinase 
recombination, would have led a skilled artisan to conclude that the inventors had 
possession of such variants at the time of filing. For your convenience, we also enclose 
herewith as Exhibit B, References 2 and 6 cited in Dr. Hallet*s Declaration, i.e.. Van 
Duyne, 2001, Annu. Rev. Biophys. Biomol. Struct. 30:87-104, and Landy, 1989, Annu. 
Rev. Biochem. 58:913-949. 

We also refer the PTO to Paragraphs 10 and 11 of Dr. Hallet's Declaration, as well 
as FIG. 1, attached thereto as Exhibit C, providing experimental results that confirm the 
expected outcome, that TRT variants in the central crossover region are substrates for 
the tyrosine recombinase Tnpl. 

Based on the above discussion and the Declaration of Dr. Hallet, it is clear that the 
skilled artisan would understand that the inventors had possession of the claimed 
invention, since the sequences of the finite number of variants of TRT that vary only in 
the central crossover region were clearly envisioned, and these variants would be 
expected to be functional as Tnpl recombinase substrates. Applicants therefore 
respectfully request reconsideration and withdrawal of the rejections under 35 U.S.C. 
112, first paragraph. 

Rejections under 35 U.S.C. 102rbl 

Claims 1-3, 11-13, 53, 55, 58 and 60 stand rejected under 35 U.S.C. 102(b) as 
being anticipated by Mahillon et al., 1988, NAR 16:11827. The PTO asserts that plasmid 
pGI2, disclosed therein, includes SEQ ID N0:3 and SEQ ID N0:2 but not more than 200 
contiguous nucleotides of SEQ ID NO: 4. Applicants respectfully request reconsideration 
and withdrawal of this rejection, because the cited plasmid does contain more than 200 
contiguous nucleotides of SEQ ID N0:4. Indeed, the entire SEQ ID N0:4 (249 nt) is in 
the pGI2 sequence at nt 498-746. Applicants also provide a printout of GenBank 
Accession XI 3481, which provides a more legible version of the pGI2 sequence from 
Mahillon et al., NAR 16:11827. All 249 nt of SEQ ID N0:4 is underlined therein. Since 



277341.1 



Page 8 of 10 



Appl. No. 09/895,435 

Amdt. dated June 8, 2004 

Reply to Office Action of March 8, 2004 

pGI2 in Mahillon et al. does comprise the entire sequence of SEQ ID N0:4, that reference 
does not anticipate any of the claims. Accordingly, withdrawal of this rejection is 
respectfully requested. 

Claims 1-3, 11-13, 53, 55, 58 and 60 also stand rejected under 35 U.S.C. 102(b) 
as being anticipated by Mahillon et al., 1988, EMBO J. 7:1515-1526. It is asserted that 
plasmids are described therein that comprise SEQ ID NO: 3 and SEQ ID NO: 2 but not 
more than 100 contiguous nucleotides of SEQ ID N0:4. Applicants respectfully request 
reconsideration and withdrawal of this rejection, because the only sequence of a plasmid 
provided in the cited reference is Tn4430, which comprises the entire SEQ ID NO: 4, at 
nucleotides 1-249. For your convenience, the applicants also provide GenBank entry 
X07651, providing the Tn4430 sequence. The entire SEQ ID N0:4 sequence is 
underlined therein. There is also no indication that any of the other plasmids described 
comprise SEQ ID N0:3, but not the entire sequence of SEQ ID N0:4. Accordingly, 
withdrawal of this rejection is respectfully requested. 

Claims 20, 63 and 65 stand rejected under 35 U.S.C. 102(b), as anticipated by, or 
103(a), as obvious over Mahillon et al., 1988, EMBO J. 7:1515-1526. It is asserted that 
the claimed kits are anticipated or obvious because E coli cells described therein that 
comprise the SEQ ID NO: 2 or SEQ ID NO: 3 containing vectors would be expected to also 
comprise both the Tnpl protein. Applicants respectfully request reconsideration and 
withdrawal of these rejections based on the claim amendments and the following 
discussion. 

The rejected claims as amended all require that the DNA molecule comprising one 
or more copies of TRT (SEQ ID N0:3), or functional variant, be in a separate container 
from the Tnpl protein. This does not occur, or would not be expected to occur with any 
composition described in the cited reference because Mahillion et al., or any other 
reference published before the instant disclosure, does not suggest any function for the 
TRT sequence. Therefore, there would be no reason to have the isolated TRT sequence 
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or functional variant in the same kit, but in separate containers, as claimed. Accordingly, 
withdrawal of this rejection is respectfully requested. 



Based on the claim amendments and the above discussion, applicants respectfully 
request withdrawal of all rejections and passage of the claims to allowance. If there are 
any minor matters preventing this result, applicants request that Examiner McGarry 
contact the undersigned attorney. 

Applicants believe that no fee is required with this filing. However, if there are 
any unexpected fees required to maintain pendency of this application, the PTO is 
authorized to withdraw those fees from Deposit Account 01-1785. 



Conclusion 



Respectfully submitted 



AMSTER, ROTHSTEIN & EBENSTEIN LLP 
Attorneys for Applicant 
90 Park Avenue 
New York, NY 10016 
(212) 336-8000 



Dated: New York, New York 
June 8, 2004 




Elie H. Gendloff 
Registration No.: 44,704 
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X13481. Bacillus thuringi...[gi:31717323 Links 



LOCUS BTPGI2XX 9672 bp DNA linear BCT 07-JUL-2002 

DEFINITION Bacillus thuringiensis plasmid pGI2 with transposon Tn4430. 
ACCESSION X13481 
VERSION X13481.1 01:3171732 

KEYWORDS plasmid; plasmid pGI2; recombinase; resolvase; transposase; 

transposon; unidentified reading frame. 
SOURCE Bacillus thuringiensis 
ORGANISM Bacillus thuringiensis 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus 
cereus group. 
REFERENCE 1 (bases 1 to 6999) 
AUTHORS Mahillon,J. and Seurinck,J. 

TITLE Complete nucleotide sequence of pGI2, a Bacillus thuringiensis 

plasmid containing Tn4430 
JOURNAL Nucleic Acids Res. 16 (24), 11827-11828 (1988) 
MEDLINE 89098342 
PUBMED 3211758 
REFERENCE 2 
AUTHORS Mahillon,J. 
TITLE Direct Submission 

JOURNAL Submitted (04-NOV-1988) Mahillon J., Plant Genetics Systems, J 

Plateaustraat 22, B-9000 Gent, Belgium 
REMARK revised by [3] 
REFERENCE 3 (bases 1 to 9672) 
AUTHORS Hoflack,L. 
TITLE Direct Submission 

JOURNAL Submitted (24-MAR-1998) Hoflack L., Plant Genetics Systems, J 
Plateaustraat 22, B-9000 Gent, Belgium 
COMMENT On Jun 2, 1998 this sequence version replaced gi:40316. 
FEATURES Location/Qualifiers 
source 1..9672 

/organism = "Bacillus thuringiensis" 
/mol_type= "genomic DNA" 
/strain = "thuringiensis Hl.l." 
/db_xref = "taxon: 1428" 
/plasmid="pGI2" 
repeat region 493..4651 

/transposon="Tn4430" 
repeat_region 493. .497 

/note = "direct repeat 1" 
repeat_unit 498..535 

/note = "inverted repeat A" 
CDS 744..1598 

/codon_start=l 



/transl_table=ll 
/product ="TnpI resolvase" 
/protein_id="CAA31832.1" 
/db_xref="GI:40317" 
/db_xref="GOA:P10020" 
/db_xref = "Swiss-Prot:P10020" 

/translation = "MDVAKQFSSYLKQENKTENTVQGYTSGIRQYIKWFEGSYDRKLT 

KLYRQNILEYISYLKNVmUSfAKSINHKISSlAKFNEFLIQKGSQQDQVILKTDMIKV 

QTVYASPTQIVEUDVKKFLQSVLEDNNKRNYAIATLLAYTGVRISEALSIKMNDFNLQ 

TGECIIRSGKGGKQRIVLLNSKVLSAIKDYLIDRKTYSTAHESPYLFISKKREKLDRT 

WNRIFKSYSNVITPHQLRHFFCTNAIEKGFSIHEVANQAGHSNIHTTLLYTNPNQLQ 
LKNKMELL" 
CDS 1617..4580 
/codon_start= 1 
/transl_table=ll 
/product ="TnpA transposase" 
/proteinid = "CAA3 1833.1" 
/db_xref="GI:40318" 
/db_xref="GOA:P10021" 
/db_xref="Swiss-Prot:P10021" 

/translation="MGVKQLLSEAQRNELMDLSRLTEWDLVTFRTFSKHDLHLILKHR 

rgynrlgfAlrlvlirypgwslteykdipqywayvtsrlrippeeflvyakrgntlw 
ehlgeirteygyqnfsseyketllqflvqqamdnnntlylieitistlrktkvilpam 

YVIEDIVWEAKQQADQKVYSILHDGLVQEQKDQLDALLLPTINGKSPLAWLKDVPAQP 

SPESFLKVIDRLQFVQKIGLTIDTTKINTNRLRQLARLGSKYEPYAFRRFNEVKRYSM 

LVSFLLEITQDLIDYAIEIHDRLMMNLQTKGKKEQDEIQQANGKKLNEKILQFITVCG 

TLIEAKETGKDAFAALDEVMSWNEMVESVEEAKQLSRPLNYDYLDLLNTRYSYVRRYA 

PTLLRSLHFRATKSGEPVLQALDTIHELNETGKRKVPHGAPLHFVSNRWQKHVYDDDG 

NINRHYYELAALTELRNHIRSGDIFVSGSRHHKAFDDYLIPYDEWNEVSNIPNGLTAP 



LKAEDYITDRINRLNEHLEWLSKNSEKLEGVDISQGKLHVERLDRGTPEEAKAFSKLL 



HSMLPRIKLTDLLIEVASWTGFHDQFIHASTNQSPDQEEQNIVLATLMAMGTNIGLTK 

]\MATPGISYRQMANASQWRMYDDAMVRAQSILVNFQKEQKLSSYWGDGTTSSSDGMR 

LSIAVRSLHADSNPHYGTGKGGTIYRFVSDQLSAYHVKVITTNARDALHVLDGLLHHE 

TDLKIEEHYTDTAGYTDQVFALTHLLGFRFAPRIRDLADTKLFSIPGGEEYE^A^QALL 

KGKINVKLIKENYEDIRRLAYSVQTGKVSSALIMGKLGSYARQNKLATALGEMGRIEK 

TLFTLDYISNKAVRRRVQKGLNKGEAINALARTIFFGQRGEFRERALQDQLQRASALN 

IIINAISVWNTVYMEKAVEELKARGEFREDLMPYAWPLGWEHINFLGEYKFEGLHDTG 

QMNLRPLRIKEPFYS" 
repeat_unit 4609.. 4646 

/note = "inverted repeat A'" 
repeat_region 4647.. 4651 

/note = "direct repeat 1" 
CDS 4859.-5473 

/note = "unnamed protein product; ORP 1" 

/codon_start=l 

/transl_table=ll 

/protein_id="CAA31836.1" 

/db_xref="GI:3171733" 

/db_xref="GOA:P10022" 

/db_xref = "Swiss-Prot: PI 0022" 

/translation="MTNTIDFKHVEKNARIRDFENEKEKFKQDHNGINQEEVNQAMQV 

LSKATGGKEIFIGTKRSPQSKVKFAQFIQDNWDYALENAFFTDEEMLFLLRIQRFLQF 

KSNCIVNDIHSRNALPMSQKQIADRLKTDKSKISRIVNSLVQKGVIVKANGHKPEGVK 
ARTYALFINPNIIYSGERDNVETTLKALFMNSKSLFKKFPIALF" 
CDS 6037..6849 

/note = "unnamed protein product; ORF 2" 

/codon_start=l 

/transl_table=ll 

/protein_id= "CAA3 1837.1" 

/db_xref="GI:3171734" 

/db_xref="GOA:P10023" 

/db_xref="Swiss-Prot:P10023" 

/translation="MKKLLISFIAILFFICGFNLKAFAAEEIIDYQSLYNQAIQEGVL 
DQNSVSYNEWLKQNKEEFMPIYQDGLKQGVFLEPLSYNEWLKLNNYGQAPTGDIELFD 



DVTPRGSWGGFTLKAGDIFITNATSSAGIVGHAAIANGDNYILHMPGAGQGNQQLSTS 

NWMQKYTASGKWIKVYRLKDQTLARDVARYADRNFYSTTGSATKNVYLDYGIDTHLYQ 
KNPTYCSKLWQALYFGSGSRNIWQAVSGIVTPYGLIDTFTSAYRPSLVKTY'' 
CDS 6896..7240 

/note = "unnamed protein product; ORF 3" 

/codon_start=l 

/transl_table=ll 

/protein_id="CAA31834.1" 

/db_xref="GI:40321" 

/db_xref="GOA:P10024" 

/dbxref = "S wiss-Prot: PI 0024" 

/translation="M]VLAIFISIVLWSFVLFNRVQAKKSIYNYIARQGIQESQLKYID 

FHKDFmGGYWlAVYVEGENPDIYYEYSYQDKKVNFQAYFNSEKAIKKKMWGGSGLTE 
lEMKKLKYPPLQ" 
CDS 7829..9166 
/codon_start=l 
/transl_table=ll 

/product="put. GI2 site-specific recombinase" 

/protein_id="CAA31835.1" 

/db_xref="GI:40322" 

/db_xref="GOA:P10025" 

/db_xref="Swiss-Prot:P10025" 

/translation="MNKFAIIHMQKFQISDVQGIQKHNQRQGKSKSNLDIDYSKSEQN 

YDLLNQQKIRYESTIKQEISERVKRKPRANSWLSEFWTASPDYMHSLSLEEQKRYF 

ESSLDFIQKRYGKQNTLYAMVHMDEATPHMHIGVMPITEDNRLSAKDMFTRKELISLQ 

QDFPLEMREKGFDVDRGEGSEKKHLSPQAFKEKQDLEVEVEQLSNTVKTHLKTKVVETH 

NQLQQTTNYIEKQNETLQKIQQQFLSLDKKIKEKKQEFETFRNQIPDKPVSMSYLREE 

TKTEVTTKLFGKPEITEKKTGNIVVTREQWRDMTEKVNAAVIVKKDYERLQKTDLVKE 

NQSLREDNKYLEETIKGNNLALKHSYKQNRELEEVNKELHTEIGTLKAHIRDLQMNIK 

VLYQQTKKVFKEQFKAFRGLIKNELDMKGVDNQFEREHTREIRSRQKGYDMER" 
ORIGIN 

1 caagactatg ttgtaaatgc tcatagcatt gtatgttgga tttcaattgt aagttccaaa 
61 gtaaactttt gaatatttgt ttactttcaa ataattggtt ttatggtgtt ttcagaggat 
121 tttagaaaaa tagagattcc cgtaaagaaa gtatacattt agaatgcgaa ttttgtacgg 
181 atgatggtta aagatactca ataaaaattt cattttgtag aaaatcatca aaataaaacg 



241 agcacttatg gcaaggcgct cgtttaaaaa acattctgtg tattagggtt gtgtttataa 
301 gttatgaacc gatgaatatt tgttaaagta aggtttaaga tagtttattc attttaaagt 
361 aaatgggaac ctgtacggca aaaaaagagg accttcgaac agtgctcttt ctctgagtct 
421 gtagtgcatg atactttatg aagttttttt agaaataaaa tgaatgagct cacgactaaa 
481 acgaacttgc tatacaagg g" gtaccgccag catttcggaa aaaaaccacg ctaagaaaat 
541 cagagttaaa aaatcagaaa atatatcatt attccaagac acatacagtg tctt ttttta 
601 tacaa aaaat aatacaacac aatattaatt gtgttgtatt aggtgttata ataaatataa 
661 atcta^gggt ttaacgcaac acaatttatc gataaataaa tacttttaga c gcaacacaa 
721 ttratagarg cg gaggaa at cacat^ atg ttgcaaaaca gttttcttct tatcttaaac 
781 aagagaataa aaccgagaac actgttcagg gatacacatc aggtattaga cagtacataa 
841 aatggtttga aggttcctat gacagaaaat taacaaaatt gtaccgacaa aatatcttag 
901 agtacattag ttatttaaag aatgtcaaaa tgttgaacgc caagtccatt aaccacaaga 
961 ttagtagcct tgctaaattt aatgaatttc taatacagaa aggaagtcaa caagatcaag 
1021 taattttaaa aacagacatg ataaaggttc aaactgtcta tgcttctcca acccaaattg 
1081 ttgaattaga tgtaaaaaag tttttacaaa gtgtgttaga ggataataac aaacgtaatt 
1141 atgcaattgc cactctccta gcatatacag gagtacgtat ttcagaggca ttatctatca 
1201 aaatgaatga cttcaattta cagactgggg aatgtattat tcgaagtgga aaaggaggta 
1261 aacaacgaat tgtattacta aatagtaagg tacttagtgc tatcaaagat tatctcatcg 
1321 atcgaaaaac atacagtaca gcacatgaat ctccgtatct ttttattagt aaaaagcgag 
1381 aaaagctcga ccgtacggtc gtcaatcgta tctttaaatc atacagcaat gttattactc 
1441 cacaccaatt acgacacttc ttctgtacga atgcaattga aaaaggattt agcattcatg 
1501 aagttgcaaa tcaagctggg cactctaaca tccatacgac actactttac acaaatccaa 
1561 accaactgca gctaaaaaat aaaatggagc tcttataaag gagtaaaggg gaaacaatgg 
1621 gagttaaaca attactgtca gaggcacaac gaaatgagtt aatggattta tcacgtttaa 
1681 cagaatggga tttggtaaca tttcgtactt tttctaaaca tgacttacat ttaatattaa 
1741 aacaccgtag aggatacaat cgtttagggt tcgctcttcg gcttgtatta attcgttatc 
1801 ctggctggtc cttaacggaa tataaggaca ttccacaata tgtagtggct tacgttacga 
1861 gtcggttacg gattccacct gaagaatttt tggtatatgc caaacgtggg aatactttat 
1921 gggagcatct tggggaaatt cgaacagaat atggatatca gaatttttct tctgaatata 
1981 aagaaacttt attacaattt ttagtgcaac aagcaatgga taacaataat accctctatt 
2041 taatcgaaat aacaatatct acattgcgaa aaacgaaagt cattcttcca gctatgtatg 
2101 tgattgaaga tatcgtctgg gaagcaaagc aacaagcaga tcagaaagtg tatagtatac 
2161 tacatgacgg tttggtccaa gaacaaaaag atcaattaga tgcgttactt ttaccaacta 
2221 taaatggtaa gtctccatta gcatggctaa aagatgtacc ggcccagcct tcaccagaat 
2281 catttttaaa agtaatagac cgattacaat ttgttcaaaa aattggtctt actattgata 
2341 ccacaaaaat taatacaaat cgtcttcggc agctcgctag attaggatca aaatatgagc 
2401 catatgcttt tagacgattc aatgaagtaa aaagatatag tatgttagtc tcatttttat 
2461 tggagattac acaagacctt atagattacg caattgaaat tcatgatcgt cttatgatga 
2521 atcttcaaac aaaagggaaa aaagaacaag acgaaataca gcaagcaaat gggaaaaaat 
2581 taaacgagaa gattttacaa tttattactg tatgcgggac actaattgag gcaaaagaaa 
2641 cgggtaaaga tgcttttgca gccctagatg aagtaatgtc atggaatgag atggtcgaat 
2701 ctgttgaaga agcaaaacaa ctatcacggc ctttaaatta tgactatttg gatttgttaa 
2761 atactcgtta ttcctatgtt agacgatatg cacctacttt attacgcagt cttcatttta 
2821 gagcgacgaa atctggagaa cccgttttac aagcacttga tacgattcat gaattgaatg 
2881 aaacaggtaa acgaaaagtg cctcatggcg ctcccttaca ctttgtatct aaccgttggc 
2941 aaaaacatgt atatgatgat gacggaaaca ttaatcggca ctattatgaa ttggcagctt 



3001 taaccgagtt aagaaatcat attcgttcag gagatatttt tgtatcaggc agtagacatc 
3061 ataaggcatt tgatgattat cttataccgt atgatgaatg gaacgaagtt tcaaacattc 
3121 caaatggcct tacagcccct ttaaaagctg aagattatat aactgaccga ataaaccgat 
3181 taaatgaaca tttagaatgg ttatcaaaaa atagcgagaa attagaagga gtagatatta 
3241 gtcaaggaaa actacatgtt gagcgattgg atagaggaac tccagaagaa gcaaaagcat 
3301 ttagtaaact tcttcatagt atgttaccca gaataaaact gacagattta ctaatagaag 
3361 tggccagttg gactggattt catgatcaat ttatccacgc ctctaccaat caatctcctg 
3421 atcaagagga acagaatatt gtattagcta ctttgatggc aatggggaca aatataggtc 
3481 ttacaaaaat ggcggaagct acccctggaa tctcctatag acaaatggca aatgcttcac 
3541 aatggagaat gtatgatgat gcaatggttc gcgctcaatc tattctcgtg aatttccaaa 
3601 aggaacaaaa gttatcatcc tattggggag atgggacaac ttcttcttct gatggtatgc 
3661 gtttatccat tgctgtacgt tctctacatg cagattccaa tccacactat ggaacgggaa 
3721 aaggtggaac catttatcga tttgtaagtg atcagctatc tgcctatcat gtgaaagtca 
3781 ttactaccaa tgcacgagat gctcttcatg tattagatgg gcttcttcat catgaaacag 
3841 atttaaaaat tgaagaacat tacacagata cagctggata tacagatcaa gttttcgctt 
3901 taacacactt gttgggattt cgatttgctc ctcgtattcg tgatttggca gatactaagc 
3961 ttttttctat acctggtgga gaagaatatg aaaatgttca agcactttta aaaggaaaaa 
4021 ttaatgtaaa gttaataaaa gaaaactatg aagatattag aaggttggca tactcggttc 
4081 aaacaggaaa agtatctagc gctcttatca tggggaagct cggatcatat gcaagacaaa 
4141 ataaactcgc aacagcactt ggagaaatgg gacgaatcga gaaaaccctc tttacgctgg 
4201 attacatatc taataaagcc gtaaggagac gggttcaaaa aggtttaaat aaaggagaag 
4261 caatcaatgc attagctaga actatatttt ttggacaacg tggagaattt agagaacgtg 
4321 ctctccaaga ccagttacaa agagctagtg cactaaacat aattattaac gctataagtg 
4381 tgtggaacac tgtatatatg gaaaaagccg tagaagaatt aaaagcaaga ggagaattta 
4441 gagaagattt aatgccatat gcgtggccgt taggatggga acatatcaat tttcttggag 
4501 aatacaaatt tgaaggatta catgacactg ggcaaatgaa tttacgtcct ttacgtataa 
4561 aagagccgtt ttattcttaa tataacggct ctttttatag aaaaaatcct tagcgtggtt 
4621 tttttccgaa atgctggcgg tacccctaca ataacatata aaaagtcaga caagatacaa 
4681 caaaaaaagc cattcatgct ataattgagt tgtcgaatct acaactatag gagaagggct 
4741 ttttttactg ttatacaatt aattagagtg ttcagactca aattaattta tctaacttca 
4801 taataacaaa ataaaagtct cttcgctagt tgtaaatact agaaacgagg agaaaagcat 
4861 gacaaatacc attgatttta aacatgttga gaaaaatgca agaatacgtg attttgagaa 
4921 tgaaaaagaa aagtttaaac aagatcataa cggaattaat caagaagaag taaatcaagc 
4981 aatgcaagta ttaagtaaag ctactggcgg aaaagaaata tttataggta ctaaacgttc 
5041 cccgcaatca aaagtaaagt ttgcccaatt tattcaagat aattgggatt atgctttaga 
5101 aaatgcattt tttactgacg aagaaatgct ctttttactt cgtattcaaa ggtttttaca 
5161 atttaaaagt aattgtattg taaatgatat tcattctaga aatgcacttc ctatgagtca 
5221 aaaacaaata gctgatagat taaaaactga taaatctaaa attagtcgca ttgtaaatag 
5281 tctggtacaa aaaggtgtta ttgtaaaagc aaatggacat aaaccagaag gggttaaagc 
5341 tcgtacatac gcattattta ttaatcctaa tattatttat agcggtgaac gtgataatgt 
5401 tgaaacaact ttaaaagctt tatttatgaa ttcgaaatca ctatttaaaa aatttccaat 
5461 agcccttttt taagggctat ttttttaatt ttaatagttg tctaatttga caactaaagt 
5521 tgtctaatct gacaactatt aaaaccacta taaacccttg atataataag ctttttaact 
5581 agtttagtag gtattttact ctcactctct atataagggc taaaagttcc gcaaggaaca 
5641 ggtcgtattt ttgaggaaaa agcactcaaa aatctccacc tttttcacta cgtttcatcc 
5701 ttgccaaact tttttcggct ggtttttagc tttcgccaag ccgaacagac caaaaacggg 
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gttaagttgc tcgacttggg ctagcaattg taccagccga accgccaatt cttccgttca 
aagcatgatt ctttttccat atgtatcaac gctatgtggg gtttagcccc cacacgacta 
gccagcactc acacccaaaa acaagggaga attacattat ttagctaata aatatttgta 
tgttttttgt cacaagtatt tacattttga atttgaaaat cgtgttacct ttataaaagg 
ttaccgaaat ataataatta tttagggaga caatttatga aaaaattatt aattagtttt 
attgctattt tattttttat ttgtggtttt aacttgaaag cgtttgctgc tgaggaaatt 
attgattatc aatctcttta taatcaagca attcaagaag gtgttttaga tcaaaatagt 
gtatcttata atgaatggtt aaaacaaaat aaagaagaat ttatgcctat atatcaagat 
ggtttaaaac aaggtgtttt tctagaaccg ttatcttata atgaatggtt aaaattaaat 
aattatggac aagcaccaac gggagatatt gaactttttg atgacgtgac tccaagagga 
agttggggtg gatttacttt aaaagctgga gatatattta taacaaacgc aactagttca 
gcaggtatag taggacatgc agctattgct aatggtgata attatatttt acatatgcct 
ggtgctggcc aaggaaatca acaattatct acatctaatt ggatgcaaaa atatactgcg 
tcaggaaaat ggattaaggt ttatcgatta aaagatcaaa cacttgctag agatgtagct 
agatatgctg atagaaattt ttattccact actggaagtg caactaaaaa tgtttattta 
gattatggta tcgatacaca tctgtatcaa aaaaatccaa cttattgttc taaattagta 
tttcaagcac tctattttgg ttctggttca agaaatgtta tgcaagcggt tagtggaatc 
gtaactcctt atggtttaat tgatactttt acttcagcat atagaccgtc attagtaaaa 
acatattaat aaataattta ggaggattga atttgaaaaa aaaaagatgt taataatgat 
ggctatcttt atatcaatag ttttagtggt tagttttgtt ttatttaata gagtacaggc 
gaaaaaatct atatataatt acattgctag acaaggaata caagaaagtc agttgaaata 
tattgatttt cataaagact ttaaaatggg cggatattgg cttgctgttt atgtggaagg 
agaaaatcct gatatttatt atgaatattc ttatcaagat aaaaaagtta attttcaagc 
ttactttaat tctgaaaaag cgataaagaa aaaaatgtgg ggtggaagtg gacttactga 
aatagagatg aaaaaattaa agtatccacc tttgcaataa taaacagtat aaaaaagaac 
ttaaatacag gataaaacaa agactaatta gtactaaata gataagtact aattagtctt 
tttatttttt aattaattta gaattaagag tttgataaag gatttaatga atgattgaaa 
taactcgcaa aacgactatt gggagaagaa aaataagcgg taagtgtcaa tggtaagcgg 
actggaatgg ttcagcggtt tatgctgatc catgaaagga gcattaagga atggacacgt 
gcaaccgatt ttccatcctc ttggtagttc ctcaaaacga accgttttgg ggacgccaac 
cggcgaggaa gcccctcaag gattgagggg ttggggagtt ggatcgatgg ggtcaagggg 
aagagttccc cttgtggggt gtggggcaaa gcccctggtc tttatcacgc tgtggcgtga 
tttggcgaag ccaacgaggt ctcgcagagc ccacaccaaa tgtaatttgg tataatgggc 
tataccaaaa ctttggattc gctctctcga tatgttatga taagctcaaa agtagtcctc 
ttacggaaga caattgtgag ggaaaaacat gaacaaattt gcaatcattc acatgcagaa 
atttcaaatt tctgatgtgc aagggattca aaaacataat caaagacaag gaaaaagtaa 
atcaaattta gacatcgatt attcgaaaag tgaacagaat tatgatttgt tgaatcaaca 
aaaaattaga tatgaaagca caattaaaca agagatcagt gagcgagtaa aacgaaagcc 
acgggcaaat tctgttgtgc tttcagaatt tgtagtaaca gcttcgcctg attacatgca 
ttcattaagt ttagaagaac agaaacgtta ttttgagagt tcactagatt ttattcaaaa 
gcgttatggc aaacaaaata cgttgtatgc gatggttcat atggatgaag cgacaccaca 
tatgcatatt ggtgtgatgc caattacaga agacaatcgt ttgtccgcaa aagatatgtt 
tacgagaaaa gaattgatat ctttgcaaca ggacttcccg ttggaaatgc gggaaaaagg 
ttttgatgta gatcggggag aaggttctga gaaaaaacat ttatcgcctc aagcctttaa 
ggaaaaacaa gatttagaag tagaagtgga acagttatcc aatgtaaaaa cgcatttaaa 
gacaaaagta gtggaaacgc ataatcaatt acaacaaacc acaaactaca tcgaaaaaca 



8521 gaatgaaacg ttacaaaaaa ttcagcaaca atttttgagt ttagataaaa aaataaaaga 
8581 aaagaaacag gaatttgaaa cgtttcgaaa tcaaattccg gataagccag tgtcaatgtc 
8641 ttatttgcga gaagaaacaa aaacagaagt aacaacgaaa ctgtttggaa agccagaaat 
8701 tactgagaag aaaacaggaa acattgtggt tacacgtgag caatggagag atatgacaga 
8761 aaaagtgaat gcagccgtta ttgttaaaaa agattatgaa cgtttacaga agacggatct 
8821 tgtaaaagag aatcaaagtt tacgagaaga taataaatat ttagaagaaa caataaaggg 
8881 gaataatctt gctttaaaac attcttataa acaaaatcgg gaattagaag aagtaaataa 
8941 agaattgcat actgaaatag gcactttaaa ggctcatata agagatttac agatgaatat 
9001 aaaggtttta tatcaacaga cgaaaaaagt ctttaaagag caatttaagg catttagagg 
9061 acttattaaa aacgaactgg atatgaaggg tgtagataat caatttgaac gtgaacatac 
9121 aagagaaata agaagtagac agaaaggata tgacatggaa cgttagaaac gttttgtgat 
9181 atactataat tattcatcac taactaattt atgggtttca tccatttaaa aatctcttta 
9241 tatgtgtgca tataaagaga ttttttgcgt ttatgtaatt aaaattatgt taagattgat 
9301 aagactacat ttgaaaatgt gggttcaagt tgctgtttga ctgaattgca caaaaaacca 
9361 attctattac acgagaattg gtttttttgc gtataacgaa tagagatata tctaggcgtt 
9421 taaggagttg tatcttatat aatttagtct aacattaata aaaagaaaat tcaaaaatat 
9481 tcactaggtt tgttgtttat aaggaattat aaaacatgaa attttgcata ttttctaatt 
9541 aattcccgga aataggaatt aatgatttat aataaataga tgaaataccc tttcagtttc 
9601 cgatgaactg tactttagtc tgttgttagt acgaagtctt ttaaaagact tcgtactttt 
9661 tttattgtcg ac 



LOCUS BTTN4430 4149 bp DNA linear BCT 12-SEP-1993 

DEFINITION Bacillus thuringiensis transposon Tn4430. 
ACCESSION X07651 
VERSION X07651.1 GI:40347 

KEYWORDS plasmid; resolvase; tiipA gene; tnpl gene; transposase; transposon. 
SOURCE Bacillus thuringiensis 
ORGANISM Bacillus thuringiensis 

Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; Bacillus 
cereus group. 
REFERENCE 1 (bases 1 to 4149) 
AUTHORS Mahillon,J. and Lereclus,D. 

TITLE Structural and functional analysis of Tn4430: identification of an 
integrase-like protein involved in the co-integrate-resolution 
process 

JOURNAL EMBO J. 7 (5), 1515-1526 (1988) 
MEDLINE 88312602 
PUBMED 2842151 
COMMENT Data kindly reviewed (03-APR-1989) by Lereclus D. 
FEATURES Location/Qualifiers 
source 1..4149 

/organism = "Bacillus thuringiensis" 
/mol_type= "genomic DNA" 
/strain="Hl.l" 
/db_xref="taxon:1428" 
/map = "plasmid pGI2" 
/clone="pGI200" 
/clone_lib="pBR322" 
repeatunit 1..38 

/note = "inverted repeat A" 
CDS 247..1101 

/note = "unnamed protein product; put. resolvase Tnp I (AA 1 
- 284)" 

/codon_start= 1 
/transl_table= 1 1 
/protein_id="CAA30491 .1" 
/db_xref="GI:40348" 
/db_xref="GOA:P10020" 
/db_xref="Swiss-Prot:P10020" 

/translation = "MDVAKQFSSYLKQENKTENTVQGYTSGIRQYIKWFEGSYDRKLT 

KLYRQNILEYISYLKNVKMLNAKSINHKISSLAKFNEFLIQKGSQQDQVILKTDMIKV 

QTWASPTQIVELDVKKFLQSVLEDNNKRNYALATLLAYTGVRISEALSIKMNDFNLQ 

TGECIIRSGKGGKQRIVLLNSKVLSAIKDYLIDRKTYSTAHESPYLFISKKREKLDRT 



WNRIFKSYSNVITPHQLRHFFCTNAIEKGFSIHEVANQAGHSNIHITLLYTNPNQLQ 
LKNKMELL" 
CDS 1120..4083 

/note = "unnamed protein product; transposase Tnp A (AA 1 - 
987)" 

/codon_start= 1 

/transl_table=ll 

/protein_id="CAA30492.1" 

/db_xref="GI:40349" 

/db_xref="GOA:P10021" 

/db_xref="Swiss-Prot:P10021" 

/translation="MGVKQLLSEAQRNELMDLSRLTEWDLVTFRTFSKHDLHLILKHR 

RGYNRLGFALRLVURYPGWSLTEYKDIPQYWAYVTSRLRIPPEEFLVYAKRGNTLW 

EHLGEIRTEYGYQNFSSEYKETLLQFLVQQAMDNNNTLYLIEITISTLRKTKVILPAM 

YVIEDIVWEAKQQADQKVYSILHDGLVQEQKDQLDALLLPTINGKSPLAWLKDVPAQP 

SPESFLKVIDRLQFVQKIGLTIDTTKINTNRLRQLARLGSKYEPYAFRRFNEVKRYSM 

LVSFLLEITQDLIDYAIEIHDRLMMNLQTKGKKEQDEIQQANGKKLNEKILQFITVCG 

TLIEAKETGKDAFAALDEVMSWNEMVESVEEAKQLSRPLNYDYLDLLNTRYSYVRRYA 

PTLLRSLHFRATKSGEPVLQALDTIHELNETGKRKVPHGAPLHFVSNRWQKHVYDDDG 

NINRHYYELAALTELRNHIRSGDIFVSGSRHHKAFDDYLIPYDEWNEVSNIPNGLTAP 

LKAEDYITDRINRLNEHLEWLSKNSEKLEGVDISQGKLHVERLDRGTPEEAKAFSKLL 

HSMLPRIKLTDLLIEVASWTGFHDQFIHASTNQSPDQEEQNIVLATLMAMGTNIGLTK 

MAEATPGISYRQMANASQWRMYDDAMVRAQSILVNFQKEQKLSSYWGDGTTSSSDGMR 

LSIAWSLHADSNPHYGTGKGGTIYRFVSDQLSAYHVKVITTNARDAUWLDGLLHHE 

TDLKIEEHYTDTAGYTDQVFALTHLLGFRFAPRIRDLADTKLFSIPGGEEYENVQALL 

KGKINVKLIKENYEDIRRLAYSVQTGKVSSALIMGKLGSYARQNKLATALGEMGRIEK 

TLFTLDYISNKAVRRRVQKGLNKGEAINALARTIFFGQRGEFRERALQDQLQRASALN 



IIINAISVWNTVYMEKAVEELKARGEFREDLMPYAWPLGWEHINFLGEYKFEGLHDTG 



QMNLRPLRIKEPFYS" 
repeat^unit 4112..4149 

/note = "inverted repeat A'" 

ORIGIN 

l(^ ^ggtaccgc cagcatttcg gaaaaaaacc acgctaagaa aatcagagtt aaaaaatcag_ 
61 aaaatatatc attattccaa gacacataca gtgtcttttt ttatacaaaa aataatacaa 
121 cacaatatta'attgtgttgi attaggtgtt ataataaata taaatctagg ggtttaacgc 
181 aacacaattt atcgataaat aaatactttt agacgcaa cacaatttatag acgcggagga 
241 aatcacat^ atgttgcaaa acagttttct tcttatctta aacaagagaa taaaaccgag 
301 aacactgttc agggatacac atcaggtatt agacagtaca taaaatggtt tgaaggttcc 
361 tatgacagaa aattaacaaa attgtaccga caaaatatct tagagtacat tagttattta 
421 aagaatgtca aaatgttgaa cgccaagtcc attaaccaca agattagtag ccttgctaaa 
481 tttaatgaat ttctaataca gaaaggaagt caacaagatc aagtaatttt aaaaacagac 
541 atgataaagg ttcaaactgt ctatgcttct ccaacccaaa ttgttgaatt agatgtaaaa 
601 aagtttttac aaagtgtgtt agaggataat aacaaacgta attatgcaat tgccactctc 
661 ctagcatata caggagtacg tatttcagag gcattatcta tcaaaatgaa tgacttcaat 
721 ttacagactg gggaatgtat tattcgaagt ggaaaaggag gtaaacaacg aattgtatta 
781 ctaaatagta aggtacttag tgctatcaaa gattatctca tcgatcgaaa aacatacagt 
841 acagcacatg aatctccgta tctttttatt agtaaaaagc gagaaaagct cgaccgtacg 
901 gtcgtcaatc gtatctttaa atcatacagc aatgttatta ctccacacca attacgacac 
961 ttcttctgta cgaatgcaat tgaaaaagga tttagcattc atgaagttgc aaatcaagct 
1021 gggcactcta acatccatac gacactactt tacacaaatc caaaccaact gcagctaaaa 
1081 aataaaatgg agctcttata aaggagtaaa ggggaaacaa tgggagttaa acaattactg 
1141 tcagaggcac aacgaaatga gttaatggat ttatcacgtt taacagaatg ggatttggta 
1201 acatttcgta ctttttctaa acatgactta catttaatat taaaacaccg tagaggatac 
1261 aatcgtttag ggttcgctct tcggcttgta ttaattcgtt atcctggctg gtccttaacg 
1321 gaatataagg acattccaca atatgtagtg gcttacgtta cgagtcggtt acggattcca 
1381 cctgaagaat ttttggtata tgccaaacgt gggaatactt tatgggagca tcttggggaa 
1441 attcgaacag aatatggata tcagaatttt tcttctgaat ataaagaaac tttattacaa 
1501 tttttagtgc aacaagcaat ggataacaat aataccctct atttaatcga aataacaata 
1561 tctacattgc gaaaaacgaa agtcattctt ccagctatgt atgtgattga agatatcgtc 
1621 tgggaagcaa agcaacaagc agatcagaaa gtgtatagta tactacatga cggtttggtc 
1681 caagaacaaa aagatcaatt agatgcgtta cttttaccaa ctataaatgg taagtctcca 
1741 ttagcatggc taaaagatgt accggcccag ccttcaccag aatcattttt aaaagtaata 
1801 gaccgattac aatttgttca aaaaattggt cttactattg ataccacaaa aattaataca 
1861 aatcgtcttc ggcagctcgc tagattagga tcaaaatatg agccatatgc ttttagacga 
1921 ttcaatgaag taaaaagata tagtatgtta gtctcatttt tattggagat tacacaagac 
1981 cttatagatt acgcaattga aattcatgat cgtcttatga tgaatcttca aacaaaaggg 
2041 aaaaaagaac aagacgaaat acagcaagca aatgggaaaa aattaaacga gaagatttta 
2101 caatttatta ctgtatgcgg gacactaatt gaggcaaaag aaacgggtaa agatgctttt 
2161 gcagccctag atgaagtaat gtcatggaat gagatggtcg aatctgttga agaagcaaaa 
2221 caactatcac ggcctttaaa ttatgactat ttggatttgt taaatactcg ttattcctat 
2281 gttagacgat atgcacctac tttattacgc agtcttcatt ttagagcgac gaaatctgga 
2341 gaacccgttt tacaagcact tgatacgatt catgaattga atgaaacagg taaacgaaaa 
2401 gtgcctcatg gcgctccctt acactttgta tctaaccgtt ggcaaaaaca tgtatatgat 
2461 gatgacggaa acattaatcg gcactattat gaattggcag ctttaaccga gttaagaaat 



2521 catattcgtt caggagatat ttttgtatca ggcagtagac atcataaggc atttgatgat 
2581 tatcttatac cgtatgatga atggaacgaa gtttcaaaca ttccaaatgg ccttacagcc 
2641 cctttaaaag ctgaagatta tataactgac cgaataaacc gattaaatga acatttagaa 
2701 tggttatcaa aaaatagcga gaaattagaa ggagtagata ttagtcaagg aaaactacat 
2761 gttgagcgat tggatagagg aactccagaa gaagcaaaag catttagtaa acttcttcat 
2821 agtatgttac ccagaataaa actgacagat ttactaatag aagtggccag ttggactgga 
2881* tttcatgatc aatttatcca cgcctctacc aatcaatctc ctgatcaaga ggaacagaat 
2941 attgtattag ctactttgat ggcaatgggg acaaatatag gtcttacaaa aatggcggaa 
3001 gctacccctg gaatctccta tagacaaatg gcaaatgctt cacaatggag aatgtatgat 
3061 gatgcaatgg ttcgcgctca atctattctc gtgaatttcc aaaaggaaca aaagttatca 
3121 tcctattggg gagatgggac aacttcttct tctgatggta tgcgtttatc cattgctgta 
3181 cgttctctac atgcagattc caatccacac tatggaacgg gaaaaggtgg aaccatttat 
3241 cgatttgtaa gtgatcagct atctgcctat catgtgaaag tcattactac caatgcacga 
3301 gatgctcttc atgtattaga tgggcttctt catcatgaaa cagatttaaa aattgaagaa 
3361 cattacacag atacagctgg atatacagat caagttttcg ctttaacaca cttgttggga 
3421 tttcgatttg ctcctcgtat tcgtgatttg gcagatacta agcttttttc tatacctggt 
3481 ggagaagaat atgaaaatgt tcaagcactt ttaaaaggaa aaattaatgt aaagttaata 
3541 aaagaaaact atgaagatat tagaaggttg gcatactcgg ttcaaacagg aaaagtatct 
3601 agcgctctta tcatggggaa gctcggatca tatgcaagac aaaataaact cgcaacagca 
3661 cttggagaaa tgggacgaat cgagaaaacc ctctttacgc tggattacat atctaataaa 
3721 gccgtaagga gacgggttca aaaaggttta aataaaggag aagcaatcaa tgcattagct 
3781 agaactatat tttttggaca acgtggagaa tttagagaac gtgctctcca agaccagtta 
3841 caaagagcta gtgcactaaa cataattatt aacgctataa gtgtgtggaa cactgtatat 
3901 atggaaaaag ccgtagaaga attaaaagca agaggagaat ttagagaaga tttaatgcca 
3961 tatgcgtggc cgttaggatg ggaacatatc aattttcttg gagaatacaa atttgaagga 
4021 ttacatgaca ctgggcaaat gaatttacgt cctttacgta taaaagagcc gttttattct 
4081 taatataacg gctcttttta tagaaaaaat ccttagcgtg gtttttttcc gaaatgctgg 
4141 cggtacccc 
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Commissioner for Patents ' ^ 
P.O. Box 1450 

Alexandria, VA 22313-1450 )^ 

Sir: ■ " 

I, Bernard Hallet, declare that: 

1. ] received a Ph.D. degree from The University Catholique de Louvain in 1993. 

2. I presently hold the position of Research and Professor Associate ac the Unite 
de Gen^tique of the Institut des Sciences de la Vie ac the Universite Catholique de Louvain. 
I have been a professor at the Universite. Catholique dc Louvain since 2001. 

3. My complete academic backgroimd, publications and professional experience 
are set forth in my Curriculum vitae, a copy of vy^hich is attached hereto as Exhibit A. 



4. I have authored or co-authored more than 18 scientific articles in the area of 
genetic engineering. 
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5, I am a co-invencor of che invencion claimed in U.S. Patent Application No> 
09/895,435. I have reviewed the contents of this patent application, che Examiner^s 
Office Action dated March 8, 2004, and che claims as amended in response to chat Office 
Action, I have reached certain conclusions regarding the amended claims based upon 
that review and my knowledge of the field of art and my experience. 

6, Amended claims 1, 13, 16 and 20 provide for the use of a "DNA molecule 
comprising one or more copies of TRT (SEQ ID NO:3) or a funcdonaJ variant thereof that 
is altered from SEQ ID N0:3 only in the central crossover region". A molecular biologist 
with knowledge of the tyrosine recombinase literature and U.S. Pacenc Application No. " 
09/895j435 would understand that functional variants altered from SEQ ID NO;3 only in 
che central crossover region would be substrates for Tnpl-mediated recombination. All 
changes in. the central region (between the TRT inverted repeats) should remain 
functional (not only chose that make the reconnbinadon site symmetrical). This 
understanding is based on th^ following discussion. 

7, In current mechanistic models for recombination mediated by tyrosine 
recombinases, DNA scrands are exchanged between the recombination partners by 
swapping and reannealing a few nucleotides from che central, or "crossover" region (6 co 
8 bp) that separates che cop and bottom strand, cleavage positions widiin the 
recombination core sites (1, for a review, see Ref 2), There is no need for specific 
contacts between the recombinase protein and the central sequence during strand 
exchange. 

8, As a consequence of this mechanism^ tyrosine recombinases can fnancdon with 

recombination sice variants containing altered crossover regions, provided that che 

idendty between the partners is preserved. This was demonstrated in early studies on 

model recombinases such lambda Inc and Cre (e.g., Ref. 3-5; reviewed in Ref. 6). 

However, changes in the central region was shown affect the overall efficiency of 
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recombinacion by influencing che cleavage reaction, rhe order of strand exchange, or the 
isomerization of the HoUiday junction intermediate (e.g.. Ref. 7-11). The ability to work 
on different pairs of recombination sites with a single recombinase is exploited by the 
Xer system to process dimeric forms of che bacterial chromosome and a variety of 
naturally occurring plasmids (For a review see Ref. 12). Early experiments also 
demonstrated that for simple recombination systems such as Flp/FRT or Cre/loxP, the 
orientation of the recombination site is wholly determined by the central region^ 
recombination between symmetrical sites giving rise to both inversion and deletion 
events (e.g,> ReF». 13-16). 

9. References cited in Paragraphs 7 and 8: 

1. Nunes-Duby:SE, Azaro M, and Landy A (1995) Curr. Biol. 5:139-148 V 

2. Van Duyne GD (2001) Aimu. Rev. Biophys. Biomol. Struct. 30:87-104. -v: 

3. Weisberg RA; Enquist LW, Foeler C and Landy A (1983) J, Mol. Biol. 17Gii319-342. 

4. DeMassy B, Studier F, Dorgai L, Appelbaum E and Weisberg RA (1984) .Cold Spring 
Harbor Symp.:(3uant, Biol. 49:715-726. 

5. Bauer C£, Gardner JF, Gumport RI (1985) J. Mol, Biol. 181:187-197. 

6. Landy A. (1989) Annu. Rev. Biochem. 58:913-949. 

7. Azaro MA and Landy A (1997) EMBO J. 16:3744-3755. 

8. Hallet B, Arciszewka LK. and Sherratt DJ (1999) MoL Cell 4:949-959, 

9. Arciszewska LK, Baker RA, Hallet B and Sherratt DJ (2000) J. MoL BioL. 299:391-^03- 

10. Lee L, and Sadowski, PD (2001) J. Biol. Chem. 276:31092-31098. 

11. Lee L, and Sadowski, PD (2003) J, MoK Biol. 326:397-412. 

12. Sherratt DJ, Arciszewska LK, Blakely G, Colloms S, Gram K, Leslie N and McCuIlogh 
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13. Andrews BJ, McLeod M, Broach JR and Sadowski PD (1986) MoL Cell BioL 6:2482- 
2489. 

14. Hoess RH, Wierzbicki A, and Ambreski K (1986) NucL Acids Res. 14:2287-2300. 

15. Senecoff JF, and Cox MM (1986) J. Biol. Chem. 261:7380-7386. 

Page 5 of 5 

27/5790 I 



1 



Declaration of Bernard Hallec under 37 C.F.R. 1.132 

16. Meyer-Leon L. Huang LC, Umlauf SW, Cox MM, and Inman RB (1988) Mol. Cell Biol. 
8:3784-3796. 

10. We have also confirmed that che claimed functional variants of SEQ ID N0:3 
are substrates for Tnpl-mediated recombination. To demonstrate this, functional 
variants of che TRT minimal core recombination site (SEQ ID N0:3) were constructed 
which included sequences in which the central 4-bp sequence between the inverted 
repeats was modified, In particular, Tnpl-mediated recombination of a DNA substrate 
carrying two TRT sites with a bidirectional symmetrical central region was 
demonstrated, giving rise co both inversion and deletion products (See FIG. 1), In 
'^.ddidon, changes in Tlie central region were found to influence che overall efficiency of 
recombination, with the TRT variant containing the 5'-ATAT-3' central region being more 
iefficient than che TRT variant with die 5'.TATA-3' central region (FIQ. 1). 

■ • 
IL The legend of FIG. 1 is as follows: Tnpl-mediatedvrecombination ac 
Symmetrical core sices in vitro. (A) Nucleotide sequence of the wild-type Cl-2) and. 
symmetrical (1.1 and 2.2) core recombinadbn sites. Changed in che 4-bp central region 
are shown in bold. Positions of Tnpl cleavages are indicated. (B) In vitro recombination 
activity was examined on supercoiled plasmids carrying two copies of the different 
versions of the TRT sequence. After phenol extraction and echanol precipitation, 
reactions were cleaved with Smal and run on a 0.8% agarose gel. Bands are identified 
as follows : SI and S2, substrate fragments; PI and P2, deletion products; II and 12, 
inversion products; Intl and Int2, product bands arising firom intennolecular 
recombination reactions between inversely oriented substrates chac cannot be seen with 
che wild type substrate; HJ, Holliday jtmcrion; Lin, linear form of the substrate, 

12. Based on che above review of the literature, a molecular biologist who was 

provided with the present patent application would have understood at the time of filing 

that the claimed functional variants would serve as substrates for Tnpl-mediaced 
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recombination, and that the inventors would have contemplated such variants. The 
experimental results provided herewith confirm the expectation that such functional 
variants would be substrates for Tnpl-mediated recombination. 

13. r hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information and belief are believed to be true; and 
further that these statements were made with the laiowledge that willful fake statements 
and the like so made are punishable by fine or imprisonment, or both, under Section 
1001 of Title 18 of. the United States Code and ^liarsuch wiUful false statements may 
jeopardize the validity of the application or a-ny patent issued thereon. 



Signed, 



Bernard Hallet 



OS /o^/^ooi/ 
Date 
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ETUDES UNIVERSIT AIRES 

1984-1988 Licence en Sciences - Section Biologie Moleculaire 



Universite Catholique de Louvain 

iere Candidature en Sc. Naturelles (Gr. B) (GD) 

2eme Candidature en Sciences (Biologie) (GD) 

iere Licence en Sciences (Zoologie) (GD) 

2enne Licence en Sciences (Zoologie) (LPGD) 

1988-1991 Dipldme d'Etudes Approfondies en Biologie 

Universite Catholique de Louvain (LPGD) 



1988-1993 Doctorat en Sciences - Biologie 

"Transposition et Mecanismes de Specificite de Cible d'IS237A, 
une Sequence d'Insertion de Bacillus thuringiensis" 
Universite Catholique de Louvain, 

Unite de Genetique, prom. prof. Jean Delcour (LPGD+) 
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EXPERIENCE POSTDOCTORALE ET FONCTIONS ACTUELLES 

1 0/93-1 0/94 Assistant de Recherches 

Universite Catholique de Louvain, Unite de Genetique prof. Jean Delcour 

• Proprietes de transposition des elements genetiques mobiles \S231A et 
Tn 4430 de Bacillus thuhngiensis. 

10/94-12/97 Sejour postdoctoral a Tetranger 

University of Oxford, UK, Dpt of Biochemistry, prof. David J. Sherratt 

• Mecanisme de recombinaison site-specifique Xer d' Escherichia coll. 

• Developpement d'une methode de mutagenese des proteines par insertion 
aleatoire de pentapeptides a partir du transposon Tn4430. 

01/98-10/98 Assistant de Recherches 
10/98-10/01 Charge de Recherches FNRS 

Universite Catholique de Louvain, Unite de Genetique, prof. Jean Delcour 

• Le systeme de recombinaison site-specifique Tnpl/irs et le mecanisme de 
transposition du transposon Tr\4430. 

• Controle genetique de la recombinaison site-specifique Cre/Lox chez 
Lactococcus lactis. 

• Participation aux autres projets du laboratoire sur la biologie moleculaire des 
bacteries lactiques (metabolisme, synthese de la parol, adaptation aux stress 
et regulation genetique). 

10/01- Chercheur Qualifie FNRS et Charge de Cours a Temps Partiel a TUCL 

10/02- Responsable de {'Unite de Genetique (GENE) 

BOURSES ET CONTRATS 

10/88-09/91 Bourse de these IRSIA 

10/91-10/94 Mandat d'Assistant de Recherches a I'UCL 
(candidature au FNRS reprise par le FDS) 
contrat interrompu pour la duree du stage a I'Universite d'Oxford 

1 1/94 Contrat de Recherche a I'Universite d'Oxford (MRC) 

1 2/94-1 1/96 Bourse postdoctorale EMBO 

12/96-1 1/97 Bourse postdoctorale des Communautes Europeennes (BIOTECH) 
12/97-09/98 Reprise du mandat d' Assistant de Recherches a TUCL (FDS) 
10/98-09/01 Mandat de Charge de Recherches FNRS 
10/01- Mandat de Chercheur Qualifie FNRS 

DOMAINES D'EXPERTISE 

Biologie generale et Moleculaire, Genetique, Microbiologie (Gram" et Gram") 
Conformation et topologie de I'ADN, structure des proteines, enzymologie 
Interactions proteines-ADN, assemblages nucleoproteiques complexes 
Recombinaison genetique et transposition (mecanismes et regulation) 
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ACTIVITES D'ENSEIGNEMENT (DEPUIS 1998) 

Participation aux "Exercices integres en Genetique Moleculaire" 

(UCL, Biol 2138, J. Delcour - M. Deghorain). 

"Modification de la relation structure-fonction de la p-lactamase TEM-1 d' E. co// par la methode 
de pentapeptidogenese", 2h-6h/an depuis 1997. 

Participation au cours de "Genetique Moleculaire" ^ 

(UCL, Biol 2137, J. Delcour). 

"Legons sur la replication et les mecanismes de recombinaison genetique", 2-4h/an de 1998-a 
2003. 

Participation au cours de "Biochimie Comparee" 

(UCL, Biol 2271, J. Delcour - R. Rezsohazy). 

"Constance et diversite dans les mecanismes permettant de couper, souder et synthetiser les 
acides nucleiques", 6h/an de 1999 a 2000. 

Charge de cours invite a I'atelier de DEA sur "la plasticite du genome bacterien" (Univ. P. 
Sabatier, Toulouse, C. Gutierrez). 

"mecanismes et fonctions biologiques de la recombinaison site-specifique: un pour tous, tous 
pour un", 4h, novembre 2000. 

Co-titulaire du cours de "Biologie moleculaire et cellulaire bacterienne" 

(UCL, Biol 2285, J. Delcour - B. Hallet). 

"La cellule bacterienne dans le temps", 15h/0h depuis 2001 . 

Co-titulaire du cours de "Genetique Moleculaire" ^ 

(UCL, Biol 2137, J. Delcour - B. Hallet). 
15h/0h depuis 2003. 

Co-titulaire du cours de "Genetique Microbienne" ^ 

(UCL, Biol 2211, A. M. Corbisier- B. Hallet). 
15h/15h depuis 2003. 

VULGARISATION ET PROMOTION DES SCIENCES 

Article didactique destine aux enseignants du secondaire 

"Initiation a la demarche biotechnologique" 
B. Hallet, (1992), Probio Revue, 15,147-172 

Ateliers destines aux eleves du secondaire 

"Du phenotype au gene", 4h/an entre 1990 et 1994. 

Conference au 37^"^^ Congres Pluraliste des Sciences, Namur, le 25/08/99 
"Les transgenoses naturelles et artificielles et leur impact sur revolution" 
Resp. J. Fraipont, President de I'Association des Professeurs de Biologie. 

Conseiller scientifique de Texposition "Gene-ethique" 

Pare d'Aventure Scientifique "Le PASS" de Frameries. 
Resp. C. Bluard, Museologue au "PASS". 

Accueil annuel d'etudiants de I'enseignement secondaire pour raccomplissement d'un 
stage de courte duree en entreprise (3 a 5 jours) 

(decret de la Communaute Frangaise de Belgique, depuis 2001-2002, 1 a 4 stagiaires par an). 
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ACTIVITES D'ENCADREMENT (DEPUIS 1998) 

PROMOTEUR DE MEMOIRES DE FIN D'ETUDES 

V. Vanhooff (1997-1998): Mise en oeuvre de la recombinaison site-specifique Cre-/oxchez 

Lactococcus lactis 

N. Michelet (1998-1999): Controle de la recombinaison site-specifique Cre-/ox 

Chez Lactococcus lactis 

C. Galloy (1999-2000): Construction et caracterisation de sites hybrides entre les 

systemes de recombinaison site-specifique Xer d' Escherichia 

coli et Tnpl//rs du transposon Tn4430 
N. Simonis (2000-2001): Construction d'un systeme de transposition controlee du 

transposon Tn4430de Bacillus thuringiensis 
M. Lambin (2001-2002): Caracterisation et purification d'une proteine de fusion entre la 

transposase de Jr\4430 et un motif polyhistidine 

D. Cochonneau (2003-2004) Construction d'un systeme hybride pour I'etude des 

determinants de Timmunite de transposition de Tn4430 

PARTICIPATION A L' ENCADREMENT DE MEMOIRES DE FIN D'ETUDES DANS LES AUTRES DOMAINES 
DE RECHERCHE DU LABORATOIRE 

C. Devoghel (1997-1998): Construction d'une souche de Lactobacillus helveticus deficiente 

en D-lactate deshydrogenase 

E. Palumbo (1997-1998): Construction et caracterisation d'un mutant de L. plantarum 

deficient pour I'alanine racemase 
I. Focant (1998-1999): Purification et caracterisation de L-lactate deshydrogenase 

sauvage et mutantes de Streptococcus thermophilus 
S. Burteau (1998-1999): Construction d'une souche de Lactobacillus helveticus deficiente 

en D-lactate deshydrogenase et production d'alanine 
C. Verheust (1998-1999): Construction et caracterisation d'un mutant de Lactobacillus 

plantarum deficient pour la glutamate racemase racemase 
L. Fontaine (1999-2000): L'expression chez L. plantarum du gene ddl de L. lactis subsp. 

cremoris entraine la sensibilite a la vancomycine et modifie les 

precurseurs du peptidoglycane 
8. Lewahert (1999-2000): Construction et caracterisation de deux souches mutantes AltA- 

et AltB- de Lactococcus lactis affectees dans le transport de 

I'alanine 

N. Delalune (2000-2001): Etude des voies de reorientation du pyruvate chez 

Streptococcus thermophilus 
W. Glenison (2000-2001): Etude de la glycosylation des acides teichoiques de parois chez 

Lactobacillus plantarum 

A. Grosjean (2000-2001): Construction de mutants de Lactococcus lactis deficients pour le 

transport de I'alanine 

V. Wouters (2000-2001): Etude de la fonction de la proteine de regulation RelAI dans la 

reponse a un choc froid chez Streptococcus thermophilus 
J-B. Beaudry (2001-2002): Effets physiologiques de la surexpression des genes de 

pyruvate oxydase de Lactobacillus plantarum 
I. Tytgat (2003-2004): Importance du D-lactate dans la synthese de 

peptidoglycane de Lactobacillus plantarum 
A. Stulkens (2003-2004): Etude de la fonction physiologique des pyruvate oxydases de 

Lactobacillus plantarum par surexpression et disruption de 

genes poxD et poxF 

E. Verplaets (2003-2004): Etude du regulateur BUR du systeme a deux composantes 

BIIH/BIIR de Streptococcus thermophilus 
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PROMOTEUR DE THESES DE DOCTORAT 

V. Vanhooff, boursiere FRIA, depuis sept. (1998-2003) 

Le systeme de recombinaision site-specifique du transposon Jn4430 

(these soutenue le 25/06/2003) 

C. Galloy, 3^""^ annee FRIA, depuis sept, 2000: 

Interactions proteine-proteines et proteines-ADN impliquees dans I'assennblage du complexe 
de recombinaison site-specifique Tnpl//rs du transposon Tn4430 

M. Lambin, 2^"^^ annee FRIA, depuis sept. 2002: 

Etude de la relation structure-fonction de la transposase du transposon replicatif Tn4430 
CO-PROMOTEUR DE THESES DE DOCTORAT 

8. Leidgens, 1^'^ annee FRIA, depuis sept 2003: 

Etude du role de la frataxine dans la biosynthese des centres fer-soufre et de son interaction 
avec la proteine lsu1P chez la levure Saccharomyces cerevisiae 
Promoteur F. Foury (Unite FYSA, UCL) 

PARTICIPATION A L'ENCADREMENT DE THESES DE DOCTORAT DANS LES AUTRES DOMAINES DE 
RECHERCHE DU LABORATOIRE 

8. Derzelle, boursiere FRIA, 1995-2000: 

Etude genetique et physiologique de trois genes csp de reponse au choc froid chez 
Lactobacillus plantarum 

F. Lorquet, boursiere FRIA, 1996-2002: 

Etude genetique et fonctionnelle des genes de la pyruvate oxydase chez Lactobacillus 
plantarum 

M. Deghorain, assistante UCL, depuis sept. 1997: 

Alteration de la synthese du peptidoglycane de la parol chez Lactobacillus plantarum 
P. Goffin, boursier FRIA, depuis sept. 1997: 

Approche genetique du role physiologique de la racemisation du lactate chez Lactobacillus 
plantarum 

E. Palumbo, boursiere FRIA, depuis sept. 1998: 

Etude genetique de la fonction des substitutions en D-alanine et en glucose des acides 
teichoiques chez Lactobacillus plantarum 

L. Fontaine, 2^""® annee FRIA, depuis sept. 2001 : 

Etude d'un systeme de regulation a deux composantes chez Streptococcus thermophilus 
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PARTICIPATION AU COMMITE D'ENCADREMENT DE THESES DE DOCTORAT EFFECTUEES DANS 
D'AUTRES LABORATOIRES 

V. Belis, depuis 2002: 

Developpement d'une strategie de selection pour revolution dirigee d*enzymes 
Promoteur ; P. Soumillion, Unite BlOP, UCL 

J, Crouzet, depuis 2002: 

Analyse structurelle et fonctionnelle des transporteurs ABC chez la plante 
Promoteur : M. Boutry, Unite FYSA, UCL 

F. Etienne, depuis 2002: 

Etude de la regulation de Tactivite de la peroxyredoxine 5 et de son implication dans la 
transduction de signaux intracellulaires 
Promoteur : B. Knoops, Unite BANI, UCL 

D. Vanham, depuis 2002: 

Etude du promoteur de transcription de NpABCI, un gene de Nicotiana plumbaginifolia codant 
pour un transporteur ABC de la membrane plasmique et dont I'expression est induite par le 
sclareol 

Promoteur : M. Boutry, Unite FYSA, UCL 
K. Bobik, depuis 2003: 

Etude de la regulation de TATPase-pompe a propton par phosphorylation 
Promoteur : M. Boutry, Unite FYSA, UCL 

J. Deherve, depuis 2003 

Creation d'une activite phosphatase de novo par evolution dirigee 
Promoteurs : J. Fastrez, P. Soumillion, Unite BlOP, UCL 

S. De Simoni, depuis 2003: 

Apoptose et necrose induites par un stress oxydant mitochondrial : etude de la fonction 
cytoprotectrice de peroxyredoxine 5 dans la lignee humaine SH-SY5Y 
Promoteur : B. Knoops, Unite BANI, UCL 

C. Labarbe, depuis 2003 : 

Evolution dirigee d'une proteine « PBP like » en beta-lactamase de class A 
Promoteurs : J. Fastrez, P. Soumillion, Unite BlOP. UCL 
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COMMITES DE LECTURE (DEPUIS 1998) 



Lecteur de 19 memoires de fin d'etudes et de DEA. 



Examinateur de 8 theses de doctorat, dont 5 a Tetranger, et de 2 theses d'Habilitation a diriger 
des recherches (HDR) 

Evaluation d'articles soumis a plusieurs revues scientifiques (EMBO J., Mol. Microbiol., Enz. 
Microbiol. Technol., Res. Microbiol.) 



COMPOSITION ACTUELLE DE L'EQUIPE RECHERCHE (2002-2003) 



Les projets de recherche orientes sur "les mecanismes moleculaires de transposition et de 
recombinaison site-specif ique" de I'Unite de Genetique (2 academique et 14 scientifiques, 
au total) occupent actuellement 5 chercheurs: 



1 chercheur qualifie: 

1 technicienne : 

2 etudiants en these de doctorat 

1 etudiante en memoire de licence 



Bernard Hallet (FNRS) 
Sylvie Rossenfosse (UCL, 75%) 
Christine Galloy (4^""^ annee FRIA) 
Michael Lambin (2^'^ annee FRIA) 
Daphne Cochonneau 



FINANCEMENT DE LA RECHERCHE (DEPUIS 1998) 



10/98-09/00 



10/00-09/01 



09/01-10/06 



10/03-10/04 



10/04-10/08 



Fonds speciaux de recherche (FSR, UCL): 

24 789 a (fonctionnement) 

"Le sytsteme de recombinaison site-specifique du transposon Tn4430: etude 

du mecanisme et applications" 

Credit aux Chercheurs # 1.5.177,01 (FNRS): 

19 831 o (fonctionnement) 

"Les interactions moleculaires impliquees dans I'assemblage du complexe de 
recombinaison et le controle de la reaction d'echange des brins d'ADN 
catalysee par le systeme de recombinaison site-specifique Tnpl//rs du 
transposon Tn 4430" 

Action de Recherche Concertee # 01/06-268 (ARC 2000, UCL, 
Communaute Frangaise de Belgique): 

247 793a sur un total de 743 453a (personnel, equipement et fonctionnement) 
"Accelerated molecular evolution of enzymes" en partenariat avec les 
laboratoires de J. Fastrez, P. Soumillion et J.-P. Declercq (BIOP/CSTR, UCL) 
Credit aux Chercheurs # 1.5.190.03 (FNRS): 
15 000 a (equipement) 

"Les mecanismes moleculaires de la transposition replicative et Timmunite de 
transposition du transposon bacterien Tn4430: une etude genetique et 
biochimique" 

Fonds de la recherche fondamentale collective # 2.5496.04 (FNRS): 

60 GOQa (equipement et fonctionnement) 

«The molecular transposition and site-specific recombination machines of 
Tn4430 » 
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PRINCIPALES COLLABORATIONS SCIENTIFIQUES 
Dr D. Lereclus: Institut Pasteur, Paris (depuis 1991) 

Etude des elements genetiques mobiles \S231 et Tn4430de Bacillus thuringiensis 

(un article en preparation) 
Dr D. J. Sherratt: University of Oxford (depuis 1994) 

Etude des mecanismes de recombinaison site-specifique 

(stage postdoctoral, 5 articles publies en commun) 
Dr F. Hayes, University of Manchester (depuis 1994) 

Developpement et utilisation d'une methode de mutagenese par insertion aleatoire de peptides 

basee sur le transposon Tn4430 ("Pentapeptide Scanning Mutagenesis") 

(4 articles publies en commun) 
Drs M. Galleni et J-M Frere, Centre d'lngenierie des Proteines, UIg (depuis 1997) 

Utilisation de mutants d'insertion dans la beta-lactamase TEM-1 pour I'expression et la 

production d'epitopes vaccinaux 
Dr J-J Letesson, Unite de Recherclie en Biologie Moleculaire, FUNDP (depuis 1998) 

Etude de la topologie de la porine OMP2B par insertion aleatoire de peptides ("Pentapeptide 

Scanning Mutagenesis") 
Dr R. Rezsohazy, Genetique du Developpement, UCL (depuis 1998) 

Etude de la relation structure-fonction de la proteine homeotique Hoxa-1 par mutagenese 

insertionnelle ("Pentapeptide Scanning Mutagenesis") 
Dr A. F. Stewart, ElVIBL, Heidelberg / Max Planck institute, Dresden (depuis 1998) 

Developpement et utilisation de la recombinaison site-specifique TnpUirs chez les mammiferes 

(un brevet depose en commun) 
Dr S. D. Colloms, University of Glasgow, Glasgow (depuis 1998) 

Echange des mecanismes de controle entre la recombinaison Xer 6' Escherichia collet le 

systeme Tnpl/IRS de ln4430 (un article en preparation) 
Dr A. M. Segall, State University of California, San Diego (depuis 2001) 

Utilisation de peptides inhibiteurs des topoisomerases et recombinases site-specifiques 

(un article en preparation) 
Dr B. Revet, Institut Gustave Roussy, Paris (depuis 2001) 

Analyse des complexes nucleoproteiques par micrsoscopie electronique 

(un article en preparation) 
Dr P, Soumillion et Prof. J, Fastrez, Unite BlOP, UCL; Prof, J-P Declercq, Unite CSTR, UCL 

(depuis 2001) 

Evolution moleculaire acceleree: developpement de nouvelles approches basees sur la 

recombinase Tnpl de Tn4430 

(Action de Recherche Concertee, ARC) 

COMMUNICATIONS SCIENTIFIQUES 

18 publications dans des revues Internationales avec comite de selection (liste ci-apres) 

1 chapitre de livre 

3 articles en preparation 

1 brevet d' invention 

34 communications a des congres a Tetranger, dont 5 exposes en seance pleniere 
28 communications a des congres en Belgique 
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LISTE BIBLIOGRAPHIQUE 
Chapitre de livre 

DNA site-specific resolution systems, 

Hallet B., V. Vanhooff, and F. Cornet. (2004), p. 145-180 in G. Phillips and B. 
Funnell (Eds.) The biology of plasmids. ASM Press, Washington, D. C. 

Publications (revues internationales) IF 2002 

Innproved adaptation to cold-shock, sationary-phase, or freezing stresses in 
Lactobacillus plantarum overproducing cold-shock proteins, 
Derzelle S., B. Hallet, T. Ferain, J. Delcour and P. Hols (2003) 

Appl. Environ. Microbiol., 69,4285-4290. 3.691 

Cold shock induction of the cspL gene of Lactobacillus plantarum involves 
transcriptional regulation, 

Derzelle S., B. Hallet, T. Ferain, J. Delcour, and P. Hols (2002), 

J. Bacteriol., 184, 5518-5523. 3.959 

Playing 'Dr Jekyll & Mr Hyde': combined mechanisms of phase variation in bacteria, 
Hallet B., (2001), 

Curr. Opin. Microbiol., 4, 570-581. 6.430 

Pentapeptide scanning mutagenesis: encouraging old proteins to execute unusual tricks, 
Hayes F., .and B. Hallet, (2000), 

Trends Microbiol., 8, 569-574. 6.665 

Changes in cspL, cspP, and cspC mRNA abundance as a function of cold shock and 
growth phase in Lactobacillus plantarum, 

Derzelle S., B. Hallet, K. P. Francis, T. Ferain, J. Delcour, and P. Hols, (2000), 

J. Bacteriol. 182, 5105-5113. 3.959 

Coupled catalysis in a recombination machine: Mutated XerC and XerD recombinases 
that stimulate strand exchange by their partner recombinase, 
Arciszewska L. K., R. Baker, B. Hallet, and D. J. Sherratt, (2000), 

J. Mol. Biol. 299, 391-403. 5.359 

Reciprocal control of catalysis by the tyrosine recombinases XerC and XerD: an 

enzymatic switch in site-specific recombination, 

Hallet B., L. K. Arciszewska, and D. J. Sherratt, (1999), 

Mol. Cell 4,949-959. 16.471 

Structure-Function Correlations in the XerD Site-Specific Recombinase Revealed by 

Pentapeptide Scanning Mutagenesis, 

Cao Y., B. Hallet, D. J. Sherratt, and F. Hayes, (1997), 

J. Mol. Biol. 274.39-53. 5.359 

Transposition and site-specific recombination: adapting DNA cut and paste 
mechanisms to a variety of genetic rearrangements, 
Hallet B., and D. J. Sherratt, (1997), 

FEMS Microbiol. Rev. 2L 157-178. 9.597 

Insertion mutagenesis as a tool in the modification of protein function: extended 
substrate specificity conferred by pentapeptide insertions in the £2 loop of TEM-1 
P-lactamase, 

Hayes F., B. Hallet, and Y. H. Cao, (1997), 

J. Biol. Chem. 272^28833-28836. 6.696 
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Xer recombination in Escherichia coli. Site-specific DNA topoisomerase activity of the 

XerC and XerD recombinases, 

Cornet F., B. Hallet, and D. J. Sherratt, (1997), 

J. Biol. Chem. 272. 21927-21931. 6.696 

Pentapeptide scanning mutagenesis: random insertion of a variable five amino acid 

cassette in a target protein, 

Hallet B., D. J. Sherratt, and F. Hayes, (1997), 

Nucleic Acids Res. 25, 1866-1867. 7.051 

\S231A insertion specificity: consensus sequence and DNA bending at the target site, 
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■ Abstract Structural models of site-specific recombinases from the lambda inte- 
grase family of enzymes have in the last four years provided an important new perspec- 
tive on the three-dimensional nature of the recombination pathway. Members of this 
family, which include the bacteriophage PI Cre recombinase, bacteriophage lambda 
integrase, the yeast Flp recombinase, and the bacterial XerCD recombinases, ex- 
change strands between DNA substrates in a stepwise process. One pair of strands is 
exchanged to form a Holliday junction intermediate, and the second pair of strands is 
exchanged during resolution of the junction to products. Crystal structures of reaction 
intermediates in the Cre-toP site-specific recombination system, together with recent 
biochemical studies in the field, support a "strand swapping" model for recombination 
that does not require branch migration of the Holliday junction intermediate in order 
to test homology between recombining sites. 



introduction 

Site-specific recombinases from the lambda integrase family of enzymes catalyze 
DNA rearrangements that are critical for a variety of important biological func- 
tions. Perhaps the best known example is the integration and excision of the bac- 
teriophage k genome into and out of the Escherichia coli host chromosome (9). 
Other functions include the resolution of multimeric plasmids and chromosomes 
to monomers in order to ensure faithful segregation upon cell division, the am- 
plification of yeast 2/i circle copy number, and the regulation of gene expres- 
sion (reviewed in 24,44,57,59). The A-integrase family, also referred to as the 
"tyrosine recombinases" (58), includes over 100 members identified on the basis 
of sequence similarity (47). The most well-studied examples include, in addi- 
tion to the integrase protein from bacteriophage k (34), the bacterial XerC and 
XerD recombinases (57), Cre recombinase from bacteriophage PI (1), and the Flp 
recombinase from Saccharomyces cerevisiae (53). 
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The tyrosine recombinases carry out the site-specific recombination reaction in 
a stepwise manner. One pair of DNA strands is first exchanged to form a Holliday 
junction intermediate, and then the HolHday junction is converted to recombinant 
products by exchange of the second pair of DNA strands. In contrast to the many 
years of biochemical and molecular genetic studies on the various integrase family 
members by several laboratories, structural models for the tyrosine recombinases 
have only recently become available, beginning with the report of the X-integrase 
catalytic domain crystal structure in early 1997 (33). Since then, structures of 
the HPl integrase catalytic domain (27), the XerD protein (62), and three Cre re- 
combinase/DNA complexes (17,22,23) have been described. Most recently, the 
structure of an Flp recombinase/DNA complex has been determined (11a). The re- 
sulting structural models for the tyrosine recombinase family have provided both a 
view of the enzyme active sites and a three-dimensional perspective on the reaction 
pathway (for minireviews of the structural work, see references 20, 36, 40, 58, 68). 
This review focuses on the three-dimensional nature of the Cvt-loxP site-specific 
recombination pathway, where models of the recombinase-DNA complexes for 
three intermediates in the reaction are available. 

THE Cxe-loxP SYSTEM 

Cre recombinase is a 38-kDa protein encoded by bacteriophage PI . Its roles in the 
PI life cycle are thought to include cyclization of the linear genome and resolution 
of dimeric chromosomes formed following DNA replication as a result of homol- 
ogous recombination (61). The DNA sequences where recombinase binding and 
strand exchange take place are named loxP in the Cre recombinase system. These 
sites are composed of two recombinase binding elements (RBEs) arranged as 
nearly perfect inverted repeats surrounding a central strand exchange or crossover 
region (Figure 1). Two recombinase subunits bind to each core site (one to each 




5 ' -ATAACTTCGTATAGCATACATTATACGAAGTTAT-3 ' 
3 • -TATTGAAGCATATCGTATGTAATATGCTTCAATA~5 ' 
^^^^^~RBE™™™"i^^ crossover I ^ RBE 

cleavage 

Figure 1 The 34-bp loxP site is composed of two 14 bp recombinase binding elements (RBEs) 
arranged as inverted repeats around a central 6-bp crossover region (28). Cleavage of the sites 
occurs at the borders between the crossover region (in boldface) and the RBEs. Two Cre recombi- 
nase subunits bind cooperatively to the loxP site (one to each RBE) with no direct contact between 
the recombinases and bases in the crossover region. 
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RBE) with a high level of cooperativity (7,51). The phosphoryl transfer strand 
exchange chemistry between substrates occurs within the 6-bp crossover region. 
The crossover sequence is asymmetric and therefore provides directionality to the 
site. For the simplest systems, exemplified by Cre recombinase, this core recom- 
bination site is sufficient for the complete recombination reaction (3,53). For 
more complex systems, such as A-integrase and the XerC/D recombinases acting 
on plasmid substrates, the sites required for recombination contain additional se- 
quences recognized by accessory proteins and auxiliary DNA-binding domains. 
The simplicity of the Cre recombinase system has led to its widespread use as a 
tool in the manipulation of DNA molecules both in vivo and in vitro (54). 

The tyrosine recombinases use a topoisomerase 1-like mechanism to cleave and 
religate DNA strands during recombination. An historical model for the recombi- 
nation pathway is shown in Figure 2, which illustrates the stepwise nature of the 
reaction. This view of the reaction involves only the core recombination sites and 
the core-binding components of the recombinase enzymes. For simple systems 
(such as Cre) recombining linear substrates, this model represents all of the inter- 
acting components. For more complex systems, there are additional factors that 
play architectural and/or energetic roles that could influence the various steps in 
the recombination pathway. For reviews of the X-integrase family recombination 
mechanism as discussed prior to 1995, see references (13, 34, 35,44,59). 

In the mechanism shown in Figure 2, two Cre-bound loxP sites associate 
to form a recombination synapse. Two of the four recombinase subunits in the 
synapse cleave the DNA substrates using conserved tyrosine side chains as nucle- 
ophiles, forming covalent 3'-phosphotyrosine linkages to DNA and releasing free 
5'-hydroxyl groups. Either the 5'-hydroxyl groups can reseal the nicks to restore 
the original substrates and complete one round of topoisomerase-I-like cleavage 
and religation, or the 5'-hydroxyl groups can attack the 3'-phosphotyrosine link- 
ages of the partner substrates, resulting in the exchange of one pair of strands 
and formation of a Holliday junction (HJ) intermediate. After branch migration 
through the crossover region, the second pair of strands in the HJ intermediate are 
then cleaved and exchanged by the second pair of recombinase subunits to form 
recombinant products. 

The tyrosine recombinases display a strong requirement for sequence identity 
in the crossover region between recombining sites. Early models have explained 
how each site senses homology with its partner by requiring the HJ intermediate to 
branch migrate through this region (66). Mismatches in sequence would provide 
energetic barriers to this process and ensure that only identical crossover regions 
could be efficiently recombined. The branch migration model is satisfying in terms 
of explaining the requirements for homology and the observed reversibility of the 
recombination pathway, but it is difficult to reconcile with the motions required of 
the recombinase subunits and DNA arms of the HJ in three dimensions (60). In or- 
der to branch migrate 6-8 bp, the recombinase-bound arms of the HJ intermediate 
would need to undergo extensive rotations, requiring that protein -protein inter- 
faces formed in the synapse be disrupted. In the mid 1990s, Landy and coworkers 
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Figure 2 The branch migration model for integrase family site-specific recombination (for a 
review of this mechanism, see reference 13). Two recombinase-bound sites associate to form a 
recombination synapse (top left). Two subunits cleave the DNA substrates with conserved tyrosine 
side chains to form 3'-phosphotyrosine linkages and release free 5'-hydroxyl groups (middle left). 
The 5'-hydroxyl groups undergo intermolecular attack of the partner phosphytyrosine to complete 
the exchange of one pair of DNA strands between the two substrates and form a Holliday junction 
intermediate (lower left). The branch point of the junction starts at the site of initial strand exchange 
and then migrates through the crossover region to the second set of cleavage sites. The second 
pair of subunits is then activated and/or positioned for cleavage of the bottom substrate strands, 
which are exchanged to form recombinant products. Heterology between crossover sequences 
would block efficient branch migration and prevent the reaction from proceeding to the second 
strand exchange. For simplicity, the DNA sites are shown associating in a parallel orientation in 
this figure. The same mechanism can also be described with an antiparallel alignment of sites 
(see reference 4). 



proposed an alternative "strand swapping- isomerization" model for integrase fam- 
ily site-specific recombination (45) that was supported by experimental results 
from a number of laboratories reported at nearly the same time (4, 8, 37, 69). In 
this mechanism, strand exchange occurs following cleavage of the site by melting 
2-3 bases from their complementary strand and annealing to the corresponding 
complementary strand in the recombining partner. Those bases are then effectively 
tested for homology with the partner by Watson-Crick base-pairing and subsequent 
ligation. As discussed in this review, the Cre-DNA structural models strongly 
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support the strand swapping, but not the branch migration model for integrase 
family site-specific recombination. 

THE STRUCTURE OF Cre RECOMBINASE 

Before three-dimensional structural data became available, most of what was 
known about protein structures in the tyrosine recombinase family came from 
sequence comparisons (2,6) and limited proteolysis experiments (29,48). The 
tyrosine recombinases share limited sequence similarity overall, although it is 
now clear that family members have a common catalytic domain fold (15,47). 
Early sequence analyses revealed four strictly conserved amino acids required 
for catalysis: the Arg-His-Arg catalytic triad and the tyrosine nucleophile (2,6). 
All four residues were found to be in the C-terminal halves of the recombinase 
sequences, which show higher sequence similarity than do the more divergent 
N-terminal halves. As illustrated in Figure 3, Cre is a two-domain protein with a 
helical N-terminal domain and a larger, primarily helical C-terminal domain. 

The complexes of Cre recombinase with DNA substrates revealed an exten- 
sive protein-DNA interface between each protein subunit and its contacted RBE 
( 1 7, 22, 23). The two domains of Cre form a C-shaped clamp that grasps the DNA 
from opposite sides, where the N-terminal domain interacts primarily with the 
major groove proximal to the crossover region on one face, and the C-terminal 
domain interacts with successive minor, major, and minor grooves on the op- 
posite face (Figure 3a). A similar organization has been observed in the Flp- 
DNA complex (I la) despite clear differences in the structures of the proteins 
N-terminal to the conserved core catalytic domains. Biochemical evidence in the 
Xer and >.-int systems, together with the available structural models, indicates that 
this recombinase-DNA organization is likely to be a common feature of the tyrosine 
recombinases (26, 63). 

In addition to the extensive protein-DNA interface formed between Cre and its 
recombination site loxPj a substantial protein- protein interface is formed between 
the two subunits bound to RBEs on the same site. On one side of the DNA sub- 
strate, the two N-terminal domains interact with one another primarily through 
helix-helix contacts, but they do not interact substantially with the catalytic do- 
mains (Figure 4a). On the opposite side, the two catalytic domains interact via the 
exchange of helices located at the carboxyl termini (Figure 4b). The C-terminal 
helix in one Cre subunit (helix-N) buries its hydrophobic surface in an acceptor 
pocket on the adjacent subunit. The network of contacts between N- and C-terminal 
domains in DNA-bound Cre subunits may in part explain the large cooperativity 
of binding to the loxP site (1,51). 

The C-terminal helix exchange is not reciprocal between Cre subunits bound 
to the same loxP site in the Cre-DNA crystal structures because a second Cve-lox 
complex interacts to form a synaptic assembly and the helix exchange is cyclic 
among the four subunits (Figure 4b). In the HP 1 -integrase catalytic domain struc- 



92 VAN DUYNE 



ture, however, a C-terminal helix swap occurs in a reciprocal fashion, leading 
to an HPI-integrase dimer (27), An HP 1 -like reciprocal exchange of C-terminal 
helices between DNA-bound Cre subunits appears to be precluded for geometric 
reasons if the loxP site is bent even slightly. Modeling exercises indicate that even 
in the case of an unbent loxP site, mutual exchange of C-terminal helices would 
be difficult and may require partial unfolding or repositioning of the penultimate 
helix (M) in order to form the correct helix-acceptor interactions (21). 

The helix-swapping arrangement observed in the Cre-DNA structures not only 
provides a model for understanding cooperative binding, bending, and synapsis 
of loxP sites, but it also provides a provocative mechanistic argument for control 
of the cleavage reaction. The conserved tyrosine nucleophile (Tyr324) is located 
close to the peptide linker (Arg326-Gly333) that connects helix-N to the rest of 
the domain. Changes in quaternary structure of the recombination complex could 
therefore be sensed by the tyrosine nucleophile through this peptide linker, provid- 
ing a stereochemical coupling between the overall structure of the recombination 
assembly and the positioning of a critical catalytic amino acid. 

SYNAPSIS AND STRAND EXCHANGE 

In order for recombination to occur, two DNA sites must associate to form a 
synaptic complex, within which the cleavage and strand exchange reactions take 
place. In the structures of two Cre mutants bound to symmetrized loxP sites 
(22), the Cre-bound DNA duplexes are bent sharply within the crossover regions 
and are brought together with all four arms (defined as the DNA segments on 
either sides of the bends) lying nearly in the same plane (Figure 4). The four 
recombinase monomers create a pseudo fourfold-symmetric network of protein- 
protein interactions responsible for holding the synapse together, with a set of 
interactions between the N-terminal domains on one side of the DNA plane and an 
independent set of interactions between C-terminal domains on the opposite side 
of the DNA substrate plane. The complex is strictly twofold symmetric overall, 
with the dyad of symmetry passing through the center of the complex in a direction 
perpendicular to the plane of the DNA substrates. A similar twofold-symmetric 
arrangement of recombinase and substrate is present in the structure of the cleaved 
covalent Cre-DNA intermediate (23) and in the structures of the Cre- and Flp-bound 
Holliday junction intermediates (11a, 17). Although the DNA sites in these cases 
are symmetric and therefore have no sequence-derived directionality that would 
specify parallel versus antiparallel alignment, the twofold-symmetric structures 
strongly imply an antiparallel alignment of wild-type recombination sites. The 
three-dimensional mechanism discussed for the Cre-loxP pathway (18, 22) in fact 
requires that the sites be aligned in an antiparallel fashion in order for productive 
strand exchange to occur in the absence of large structural rearrangements. 

The nature of the protein-protein contacts in the Cre-DNA synaptic complexes, 
and more recently in the Flp-HJ complex, have provided a great deal of insight into 
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many aspects of the recombination process. A striking feature of these structures 
is the types and the extents of the protein-protein interfaces. In the Cre tetramer, 
the self-association of N-terminal domains alone buries ~4000 of solvent ac- 
cessible surface area, with the interactions between C-terminal domains burying 
a further ~7000 A^. The N-terminal domains primarily interact with one another 
(and with the DNA) to form a nearly fourfold-symmetric tetramer via helix-helix 
and loop-helix interactions. As a consequence of this symmetry, the interactions 
between two N-terminal domains bound to the same loxP site are nearly identical 
to the synaptic interactions between N-terminal domains on different sites. This 
arrangement of N-terminal domains is capable of accommodating the synapsed 
substrates, the cleaved substrates, and the HoUiday junction intermediate in 
both halves of the pathway with virtually no changes in quaternary structure. The 
N-terminal domains therefore appear to form a relatively rigid structure that per- 
sists throughout the reaction. 

Some of the most informative insights from structural models in the tyrosine 
recombinase system concern the geometry of the cleavage and strand exchange 
steps in the reaction. Our first snapshot of this process in action came from the crys- 
tal structure of Cre recombinase covalently bound to a suicide DNA substrate (23). 
This intermediate was formed using symmetrized loxP sites that contained nicks 
adjacent to the scissile phosphates. Cleavage by the recombinase proteins resulted 
in formation of 3'-phosphotyrosine linkages to the DNA and release of free cytidine 
molecules, which diffuse away from the active sites. The resulting complex was 
trapped at the strand exchange step because the departing cytidine takes with it the 
5'-hydroxyl required to either continue or reverse the strand exchange reaction, 
and the remaining strand is too short on the 5'-end to reach either of the activated 
phosphotyrosines. 

The covalent Cre-DNA complex revealed a synaptic assembly with architec- 
ture nearly identical to that later observed in the precleavage synaptic complex 
(Figure 5). The recombinase proteins form a pseudo-fourfold-symmetric tetramer 
with the DNA duplex substrates in a distorted square planar arrangement. Two of 
the four protein subunits diagonally related in the complex have cleaved the DNA 
to form covalent bonds to the scissile phosphate, but the other two subunits have 
not cleaved the DNA. Remarkably, the 5'-ends of the DNA strands that would 
be transferred to the partner substrates in the normal reaction (if the 5'-cytidines 
were present) have partially melted away from their complementary strands, with 
their 5'-thymidine ends converging in the middle of the synaptic complex (see 
Figure 5 b). 

It is tempting to think of this cleaved intermediate structure as a snapshot of 
the strand exchange process in action. In this context, the complex structure could 
equally well represent a trapped strand transfer reaction in the HJ-forming step or 
in the HJ-resolving step. One of the interesting questions concerning this view of 
the structural model is whether the observed protein-DNA architecture needs to 
undergo a significant change in quaternary structure in order to accommodate the 
transfer and ligation of DNA strands between opposing halves of the recombination 
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Figure 6 Isomerization model for the HJ intermediate in Crc-loxP site-specific recombination. 
The conformer on the left differs from that on the right by an exchange in the interarm angles and 
an exchange in the positions of the branch-point phosphates. The stereochemical identities of the 
dark (crossing) strands on the left are identical to those of the light (crossing) strands on the right. 
Likewise, the light strands (continuous) on the left are equivalent to the dark (continuous) strands 
on the right. The strands labeled "crossing" are activated for cleavage and exchange in the two 
conformers. 

assembly. Modeling exercises suggest that the intact strands (with the 5'-cytidine 
restored) would be able to reach the complementary strands of the partner sub- 
strate and form at least two Watson-Crick base pairs immediately adjacent to the 
scissile phosphate. This annealing process could occur by moving only the three 
nucleotides at the 5'-ends of the DNA strands and leaving the remainder of the 
protein-DNA assembly unchanged. With more extensive movement of the DNA 
strands, a third base pair (and thus, a fully base-paired HJ intermediate) could be 
formed. The structure of the Cre-HJ intermediate (discussed below) indicates that 
three bases are exchanged between duplex substrates, placing the branch point 
of the HJ formed at the center of the loxP site. This implies that exactly half of 
the 6-bp crossover site between scissile phosphates are exchanged in forming the 
HJ (Figure 6) and the remaining half are exchanged in the resolution of the HJ 
intermediate to products. 



In principle, either of the DNA strands in the two duplex substrates could be cleaved 
and exchanged first in the reaction pathway. From a structural point of view, it is 
useful to consider whether there are differences between the DNA strands and/or 
the active sites in the precleavage synaptic complex that would favor the initial 
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cleavage and exchange of one strand over the other. In the Cre-DNA synapse 
formed with inactive Cre mutants and symmetrized loxP sites (22), there is a clear 
distinction between the two strands. The strand that corresponds to the one caught 
in the act of swapping has a heiicoidai trajectory that takes it through the middle 
of the open strand exchange region between duplex substrates. This strand is also 
relatively free from recombinase-backbone contacts that might limit its mobility. 
In contrast, the complementary strand is embedded in a more extensive protein 
interface, which is primarily composed of sugar and phosphate contacts located 
on the side of the DNA duplex pointing away from the strand exchange region. 
If this strand were to be exchanged first, a more complicated unraveling from the 
duplex and dissociation from recombinase interactions would be required. 

These observations from the precleavage Cre-DNA complex are consistent with 
the DNA strand that was caught in the act of exchange in the covalent intermediate 
structure. Together they suggest that the order of strand exchanges in the recom- 
bination pathway is determined once the synaptic complex is formed. However, 
two distinct antiparallel (and two distinct parallel) synapses are possible, which 
differ by the direction of bending of the sites. Recombination would therefore be 
expected to proceed with an opposite order of strand exchanges in these two cases 
(22). For Cre, a preference for cleaving at the "G/C" end of the loxP crossover 
(left side in Figure 1) has been reported (30), which indicates that formation of one 
of the two antiparallel synapses is slightly favored over the other. In principle, a 
bending preference arising from the crossover sequence itself could be responsible 
for the observed bias, although there are no published experimental data to sup- 
port this idea for the Cre-loxP system. Independent biochemical evidence based 
on studies of bulged DNA substrates in the Cre and Flp systems also supports this 
relationship between bending of the core recombination site and strand cleavage 
preference in the synaptic complex (38). 

The Flp recombinase, which has a larger crossover region compared to Cre, 
shows little or no preference for the order of strand exchange during recombination 
(38,39). Although bending of the /rr site by Flp has been well studied (42,43), 
there is no strong evidence that the bend occurs in a preferred direction, which is 
consistent with a lack of initial cleavage preference. The k-lni and Xer systems, 
on the other hand, show a clear preference for cleaving one strand first during 
recombination (5,31,46). 

THE HOLLIDAY INTERMEDIATE 

At present, there are two crystal structures available representing the Cre-HJ inter- 
mediate (17) and, more recently, a crystal structure of Flp recombinase bound to 
an HJ substrate with a 7-bp crossover site (11a). In one of the Cre-bound junctions 
(HJ2 in reference 16), eight overlapping DNA strands are assembled to form an 
HJ with four nicks (missing phosphates) that is fourfold symmetric with respect to 
sequence symmetry and in principle is free to branch migrate within the crossover 
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region. The branch point is located at the center of the crossover region in this 
structure. The other Cre-bound HJ (HJI in reference 17) is an immobile junction 
with the branch point fixed by design at the center of the crossover region. The 
structures of the two junctions are nearly superimposable, indicating that the pres- 
ence of the nicks and the asymmetries imposed to form an immobile junction most 
likely do not distort the observed structures significantly from that of the true HJ 
intermediate in Cxt-loxP site-specific recombination. 

The Cre-bound HJ arms, defined here as the duplex DNA segments extending 
away from the branch point, lie nearly in the same plane with a small amount of 
curvature within each arm that creates a slightly concave surface on the side of the 
junction plane where the recombinase catalytic domains bind. The arms adopt a 
distorted square planar arrangment that is strictly only twofold symmetric, with 
interarm angles of ~75° and ~105° (Figure 7). At the center of the junction all 
bases are Watson-Crick paired, and all eight bases are unstacked and exposed to 
solvent. The nearly square planar arms and the unstacking at the branch point 
in the Cre-HJ complexes most closely resemble the square planar form of the HJ 
characterized in solution by Lilley and coworkers (14) and observed in the crystal 
structures with bound RuvA protein (25,52). This free junction conformer exists 
in the absence of divalent ions or high concentrations of monovalent ions, where 
phosphate-phosphate repulsions presumably lead to a fully extended fourfold sym- 
metric structure (4 1 ), 

In addition to the unequal interarm angles, the geometry of the deoxyribose- 
phosphate backbone is quite different for the two pairs of DNA strands in the 
Cre-bound HJ. The strands that span the obtuse interarm angle form a more 
or less uninterrupted helicoidal trajectory through the branch point of the junc- 
tion. The DNA strands that span the acute interarm angles have a rather sharp 
discontinuity at the branch point, where the branch point phosphate rotates to- 
ward the center of the junction, away from the path traced by the flanking phos- 
phates (Figure 7b). Thus, in the Cre-bound HJ, like in the stacked- X form of 
the junction in solution (14), there are a pair of continuous DNA strands and 
a pair of crossing DNA strands that are stereochemically distinct. It follows, 
therefore, that the recombinase active sites that surround the scissile phosphates 
on the continuous versus crossing strands in the junction are not stereochemi- 
cally equivalent. Based on a comparison with the covalent intermediate complex 
(Figure 5), the crossing strand active sites are activated for cleavage and the 
continuous strand active sites are prevented from cleaving the DNA during 
recombination. 

It has been understood for many years that a change in the structure of the 
HJ intermediate in tyrosine recombinase recombination must serve the pivotal 
role of deciding whether the reaction should proceed forward to recombinant 
products or backward to restore the original substrates (see discussion in references 
13, 17,45). Early models of this isomerization step involved a migration of the 
junction branch point between the sites of cleavage within the crossover region 
(Figure 2). The strands cleaved in resolution of the HJ would be those whose 
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scissile phosphates were nearest the junction branch point in a given branch-point 
isomer. In addition to providing a mechanism for reading homology between 
sites, this model provided a rationale for how pairs of active sites could be turned 
on and off in response to the quaternary structure of the recombination complex. 
The Cre-DNA complex structures, and in particular the structures of the Cre-HJ 
intermediates, have provided a different view of this central isomerization step in 
the recombination pathway, which along with biochemical studies from several 
laboratories has led to an alternative model (17, 18). 

In the specific case of the Cre-bound HJ complex, there are two different ways 
one could draw the structure shown in Figure 7b. The conformers differ by which 
arms form acute and which arms form obtuse angles, and by which DNA strands 
adopt the crossing configuration with inverted phosphates at the branch point and 
which strands adopt the more continuous configuration. The Cre-loxP model for 
HJ isomerization shown in Figure 6 involves a planar scissoring motion of the 
HJ arms that swap the pairs of interarm angles along with adjustments to the 
deoxyribose-phosphate backbone torsion angles that lead to a swap of branch 
point phosphate configurations. The analogy with crossover isomerization of the 
Holliday intermediate in homologous recombination is evident. In both cases, the 
generation of strand equivalence by quaternary changes in the junction structure 
is accomplished. 

The effect of HJ isomerization on the catalytic domains of the recombinase 
subunits mirrors that of the DNA strands. Those pairs of adjacent subunits that 
are separated by a larger distance in one conformer are closer together in the 
alternative conformer. Accordingly, the nature of the interactions between the cat- 
alytic domains in these two distinct interfaces is exchanged between recombinase 
pairs. These differences in domain-domain interfaces may directly relate to the 
positioning of active site residues and therefore in determining which subunits are 
activated for cleavage. The situation with the N-terminal domains of the Cre sub- 
units is somewhat different. These domains maintain a nearly fourfold-symmetric 
arrangement in both isomers, although the interactions formed with the crossing 
and continuous strands of the junction are slightly different. 

In terms of active versus inactive subunits in the Cre-HJ model, the subunits 
whose active sites are positioned on the crossing strands are predicted to be active 
for strand exchange, and the pair of subunits on the continuous strands are pre- 
dicted to be rendered inactive. These predictions follow from a comparison with 
the cleaved Cre-DNA intermediate (23), where the active site that has catalyzed 
formation of a covalent Cre-DNA linkage corresponds to one containing the cross- 
ing strand phosphates in the HJ complex. A similarity also exists at the quaternary 
structural level between the synapsed substrates and the HJ intermediate. Aside 
from differences in DNA structure at the center of the synaptic complex due to the 
strand exchange event, the structures of the synapsed sites, the covalent Cre-DNA 
intermediate, and the Cre-HJ intermediate are nearly superimposable, allowing for 
easy prediction of the active subunits in the two complexes where direct evidence 
of cleavage is not available. 
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It is clear from Figure 6 that the proposed isomerization of the Cre-HJ inter- 
mediate does not involve migration of the branch point from the center of the 
crossover region. In fact, given the extent of the protein-protein interfaces in the 
Cre tetramer (average of ~3400 surface buried in each pair of adjacent sub- 
units), it is difficult to imagine how this HJ intermediate could undergo any branch 
migration. A coordinated rotation of the protein-bound DNA arms to move the 
branch point would require that the observed protein-protein interface be disrupted 
and a new interface be formed. Even if the old and new interfaces were isoener- 
getic, considerable activation energy would no doubt be required to accomplish 
this type of rearrangement. 



RECOMBINASE ACTIVE SITES 
AND PHOSPHORYL TRANSFER 

The active site model that has emerged from the integrase family protein and 
protein domain structures (27, 33, 62), the closely related eukaryotic topoisomerase 
lb structures (12,49), and an analysis of three intermediates in the Cre-DNA 
pathway is illustrated in Figure 8. The previously identified Arg-His-Arg side 
chains coordinate the scissile phosphate during recombination, while the conserved 
tyrosine side chain is positioned near the scissile phosphate, poised for nucleophilic 
attack. In addition to these four residues, two new participants in the enzyme active 




Figure 8 Active site cleavage model for Cre recombinase. The general base responsible for 
accepting a proton from the tyrosine nucleophile during cleavage has not been clearly identified 
biochemically but could be His-289. The general acid responsible for protonating the 5'-0H 
leaving group has been identified as the equivalent of Lys-201 in the vaccinia vims topoisomerase 
and is therefore likely to play the same role in the tyrosine recombinases, as shown here (32). The 
ligation reaction is simply the reverse of that shown above, beginning with nucleophilic attack of 
the 3'-phosphotyrosine intermediate by a 5'-hydroxyl group. 
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site have been identified. A tryptophan forms a hydrogen bond to the scissile 
phosphate via the indole nitrogen in Cre, whereas a histidine side chain occupies 
this position in the sequences of A-Int, HPl-Int, XerD, and the vaccinia and human 
topoisomerases. A comparison of tyrosine recombinase sequences indicates that 
this His/Trp hydrogen bond is likely to be conserved throughout the protein family. 

The second previously unidentified active site component is a lysine residue 
located in the loop between ^-strands 2 and 3 (Lys201 in Cre). Structural and 
biochemical studies in the vaccinia virus topoisomerase system were key to real- 
izing the importance of this residue in the site-specific recombinases (12,67). In 
the vaccinia virus enzyme, mutation of the corresponding residue (Lysl65) results 
in a loss of Iff* in catalytic efficiency (67). The alanine mutants of this lysine in 
XerD (10) and Cre (F Quo & G Van Duyne, unpublished observations) are each 
defective in recombination. In the covalent Cre-DNA and human topoiosomerase- 
Ib DNA structures, this lysine contacts a base adjacent to the cleaved phosphate in 
the minor groove (23, 49). In the synaptic Cre/DNA complexes and in the Cre/HJ 
complex structures, the loop containing Lys-201 is not well ordered, most likely 
indicating a high degree of mobility. 

A general model for acid-base catalysis of the initial phosphoryl transfer step in 
the tyrosine recombinase cleavage reaction is shown in Figure 8 (see also references 
19 and 58). The Arg-His-Arg triad of conserved side chains, along with the Trp 
side chain observed in Cre, provides an array of hydrogen bond donors surrounding 
the scissile phosphate that fulfills the requirement of stabilizing a pentacoordinate 
transition state of the phosphate and probably contributes an electrophilic catalysis 
component to the reaction. The equivalent of Lysine-201 has been shown to be the 
catalytic acid responsible for protonating the 5'-OH leaving group in the vaccinia 
virus topoisomerase system (32), and it seems quite likely that the same is true in 
the tyrosine recombinases. The identity of a general base that accepts the Tyr-324 
hydroxyl proton is still uncertain, although His-289 is a prime candidate. 

SUMMARY 

The convergence of biochemical, genetic, and structural studies in an area of bio- 
logical research generally results in a strengthened view of the underlying molec- 
ular processes. This is clearly the case for the tyrosine recombinase family of 
site-specific recombination enzymes. Structural data for five of the integrase family 
recombinase systems and two eukaryotic topoisomerase lb systems have provided 
a three-dimensional framework for understanding years of results obtained by 
a number of different laboratories. Biochemical studies in the vaccinia virus topoi- 
somerase lb enzyme have also contributed greatly to our understanding of catalysis 
in this superfamily. 

Our current understanding of the tyrosine recombinase site-specific recom- 
bination pathway in the three-dimensional sense has been guided primarily by 
structures of reaction intermediates in the Cre-loxP system. The most surprising 
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feature of what we have learned from this system is that the reaction appears to 
proceed with a single protein-DNA architecture that requires no large changes in 
quaternary structure in converting substrates to products. A subtle isomerization 
of the nearly coplanar HoUiday junction intermediate between dyad-symmetric 
forms may be sufficient to act as the active site switch that triggers the exchange 
of only one of the two pairs of DNA strands to generate either substrates or prod- 
ucts. The current model for the Cre-loxP recombination pathway is outlined in 
Figure 9, This mechanism requires antiparallel synapsis of the sites in order to 
lead to productive strand exchange. The strand exchange process occurs by a 
strand-swapping mechanism based on that proposed in 1995 by Nunes-Diiby et 
al (45). Three bases are tested for homology with the recombining site and ex- 
changed to form the Holliday intermediate, and the remaining three bases in the 
crossover region are tested during the second strand exchange step. 

Perhaps the most compelling questions remaining at the structural and mech- 
anistic level for the tyrosine recombinases involve the detailed stereochemical 
interfaces between subunits that regulate the recombination process. While Cre 
and XerC/D have similar N-terminal domain structures and might be expected 
to form similar protein-protein contacts in the recombination assembly that could 
mediate an allosteric regulatory process, it is clear that these contacts have not been 
conserved in Flp recombinase (11a). A discussion of the stereochemical compo- 
nents of this regulatory system, given the available structural and biochemical data, 
is presented elsewhere (64). 

A closely related question for the future regards the isomerization of the HJ- 
intermediate in tyrosine recombinase recombination. While the isomerization 
appears to be quite subtle in the Cre system, requiring only small changes in 
quaternary structure and no branch migration of the junction, the same need not be 
true of Flp, Int, and XerCD, where the crossover sequence lengths are in most cases 
different. In the Flp recombinase 8-bp crossover system, both a limited branch 
migration model and a simple 4-bp swap model would seem possible (65). In the 
XerCD 8 bp crossover and A-Int 7 bp crossover systems, the formation of specific 
nucleoprotein architectures adjacent to the recombination sites leads to interesting 
questions regarding the role of branch migration in the recombination pathway. 
The extent to which these nucleoprotein assemblies might inhibit, promote, or 
simply tolerate a limited branch migration process during recombination (16) will 
be an important issue in understanding the more complex tyrosine recombination 
systems in the future. 

ACKNOWLEDGMENTS 

I gratefully acknowledge the valuable comments, discussions, and support re- 
ceived from members of the site-specific recombination community during my 
laboratory's work in the Crt-loxP system. I also thank Drs. Feng Guo and Michael 
Gopaul, who carried out the crystallographic studies in the Crc-loxP system de- 
scribed here, and the National Institutes of Health for financial support. 



Cxt-loxP RECOMBINATION 101 



Visit the Annual Reviews home page at www.AnnualReviews.org 



LITERATURE CITED 

L Abremski KE, Hoess RH. 1984. Bacte- 
riophage PI site-specific recombination. 
Purification and properties of the Cre re- 
combinase protein. J. Biol. Chem. 259: 
1509-14 

2. Abremski KE, Hoess RH. 1992. Evidence 
for a second conserved arginine residue 
in the integrase family of recombination 
proteins. Protein Eng. 5:87-91 

3. Abremski KE, Hoess RH, Sternberg N. 
1983. Studies on the properties of PI 
site-specific recombination: evidence for 
topologically unlinked products following 
recombination. Ce// 32:1301-1 1 

4. Arciszewska L, Grainge I, Sherratt D. 
1995. Effects of Holliday junction position 
on Xer-mediated recombination in vitro. 
EMBOJ. 14:2651-60 

5. Arciszewska LK, Sherratt Dj. 1995. Xer 
site-specific recombination in vitro. EMBO 
y. 14:2112-20 

6. Argos P, Landy A, Abremski K, Egan JB, 
Haggard LE, et al. 1 986. The integrase fam- 
ily of site-specific recombinases: regional 
similarities and global diversity. EMBO 
J. 5:433^0 

7. Blakely G, May G, McCulloch R, 
Arciszewska LK, Burke M, et al. 1993. 
Two related recombinases are required for 
site-specific recombination at dif and cer 
\nE.coli¥.\2. Cell 75:351-61 

8. Burgin AB, Nash HA. 1995. Suicide sub- 
strates reveal properties of the homology- 
dependent steps during integrative recom- 
bination of bacteriophage lambda. Curr. 
Biol. 5:1312-21 

9. Campbell A. 1962. Episomes. A^/v. Genet. 
11:101-45 

10. Cao YH, Hayes F. 1999. A newly identi- 
fied, essential catalytic residue in a critical 
secondary structure element in the inte- 
grase family of site-specific recombinases 
is conserved in a similar element in eucary- 



otic type IB topoisomerases. J. Mol. Biol. 
289:517-27 

1 1. Carson M, 1991. Ribbons 2.0. J. Appl. 
Crystallogr. 24:958-61 

1 la. Chen Y, Narendra U, lype LE, Cox MM, 
Rice PA. 2000. Crystal structure of a 
Flp recombinase-Holliday junction com- 
plex: assembly of an active oligomer by 
helix swapping. Mol. Cell. 6:885-97 

12. Cheng CH, Kussie P, Pavletich N, Shu- 
man S. 1998. Conservation of struc- 
ture and mechanism between eukaryotic 
topoisomerase I and site-specific recom- 
binases. Cell 92:841-50 

13. Craig NL. 1988. The mechanism of 
conservative site-specific recombination. 
Anmi. Rev. Genet. 22:77-105 

14. Duckett DR, Murchie AIH, Diekmann S, 
von Kitzing E, Kemper B, et al. 1 988. The 
structure of the Holliday junction, and its 
resolution. CW/ 55:79-89 

15. Esposito D, Scocca JJ. 1997. The inte- 
grase family of tyrosine recombinases: 
evolution of a conserved active site do- 
main. Nucleic Acids Res. 25:3605-14 

16. Franz B, Landy A. 1995. The Holliday 
junction intermediates of lambda integra- 
tive and excisive recombination respond 
differently to the bending proteins integ- 
ration host factor and excisionase. EMBO 
/ 14:397-406 

1 7. Gopaul DN, Guo F, Van Duyne GD. 1 998. 
Structure of the Holliday junction inter- 
mediate in Cre-loxP site- specific recom- 
bination. EMBOJ. 17:4175-87 

1 8. Gopaul DN, Van Duyne GD. 1 999. Struc- 
ture and mechanism in site-specific reco- 
mbination, Curr. Opin. Struct. Biol. 9: 14- 
20 

19. Grainge I, Jayaram M. 1999. The inte- 
grase family of recombinase: organiza- 
tion and function of the active site. MoL 
Microbiol. 33:449-56 



102 VAN DUYNE 



20. Grindley NDF, 1997. Site-specific recom- 
bination: synapsis and strand exchange re- 
vealed. Cum BioL 7:R608-12 

21. Guo F. 2000. Structural studies on 
the mechanism of Cre-loxP site- specific 
recombination. PhD thesis. Philadelphia: 
Univ. Penn. 273 pp. 

22. Guo F, Gopaul DN, Van Duyne GD. 1999. 
Asymmetric DNA bending in the Cre-loxP 
site-specific recombination synapse. Proc. 
Natl. Acad Sci. USA 96:7143-48 

23. Guo F, Gopaul DN, Van Duyne GD. 
1997. Structure of Cre recombinase com- 
plexed with DNA in a site-specific re- 
combination synapse. Nature 389:40- 
46 

24. Hallet B, Sherratt DJ. 1997. Transposition 
and site-specific recombination: adapting 
DNA cut-and-paste mechanisms to a va- 
riety of genetic rearrangements. FEMS 
Microbiol. Rev. 21:1 57-78 

25. Hargreaves D, Rice DW, Sedelnikova SE, 
Artymiuk PJ, Lloyd RG, et al. 1998. Crys- 
tal structure of E, coli RuvA with bound 
DNA Holliday junction at 6 A resolution. 
Nat. Struct. BioL 5:441-46 

26. Hayes F, Sherratt DJ. 1997. Recombi- 
nase binding specificity at the chromosome 
dimer resolution site dif of Escherichia 
coli. J. Mol. BioL 266:525-37 

27. Hickman AB, Waninger S, Scocca JJ, Dyda 
F. 1997. Molecular organization in site- 
specific recombination: the catalytic do- 
main of bacteriophage HPl integrase at 
2.7 A resolution. Cell 89:227-37 

28. Hoess RH, Abremski KE. 1985. iVlech- 
anism of strand cleavage and exchange 
in the Cre-lox site-specific recombination 
system. / MoL BioL 181:351-62 

29. Hoess RH, Abremski KE, Irwin S, Kendall 
M, Mack A. 1990. DNA specificity of the 
Cre recombinase resides in the 25 kDa car- 
boxy I domain of the protein. J. MoL BioL 
216:873-82 

30. Hoess RH, Wierzbicki A, Abremski KE. 
1990. Synapsis in the Cre-lox site-specific 
recombination system. In Structure and 



Methods, ed. RH Sarma, MH Sarma, pp. 
203-213. New York: Adenine 

31. Kitts PA, Nash HA. 1988. Bacteriophage 
lambda site-specific recombination pro- 
ceeds with a defined order of strand ex- 
changes, y. MoL BioL 204:95-107 

32. Krogh BO, Shuman S. 2000. Catalytic 
mechanism of DNA topoisomerase IB. 
MoL CW/ 5: 1035-41 

33. Kwon HJ, Tirumalai R, Landy A, Ellen- 
berger T. 1997. Flexibility in DNA recom- 
bination: structure of the lambda integrase 
catalytic core. Science 276: 1 26-3 1 

34. Landy A, 1989. Dynamic, structural, and 
regulatory aspects of lambda site-specific 
recombination. Annu, Rev. Biochem. 58: 
913^9 

35. Landy A. 1993. Mechanistic and structural 
complexity in the site-specific recombina- 
tion pathways of Int and FLP. Curr. Opin. 
Genet. Dev. 3:699-707 

36. Landy A. 1999. Coming or going it's an- 
other pretty picture for the lambda-Int 
family album. Proc, NatL Acad. ScL USA 
96:7122-24 

37. Lee J, Jayaram M. 1995. Role of partner 
homology in DNA recombination. J. BioL 
Chem. 270:4042-52 

38. Lee J, Tonozuka T, Jayaram M. 1997. 
Mechanism of active site exclusion in a 
site-specific recombinase: role of the DNA 
substrate in conferring half-of-the-sites 
activity. Genes Dev. 1 1:3061-71 

39. Lee J, Tribble G, Jayaram M. 2000. Reso- 
lution of tethered antiparallel and parallel 
holliday junctions by the Flp site-specific 
recombinase. / MoL BioL 296:403- 
19 

40. Li 1 ley DM J. 1997. Site-specific recombi- 
nation caught in the act. Chem. BioL 4:7 1 7- 
20 

41. Lilley DMJ, Clegg RM. 1993. The 
structure of the four-way junction in 
DNA. Annu. Rev. Biophys. BiomoL Struct. 
22:299-328 

42. Luetke KH, Sadowski PD. 1995. The 
role of DNA bending in Flp-mediated 



CrQ~loxP RECOMBINATION 103 



site- specific recombination. / MoL BioL 
251:493-506 

43. Luetke KH,Sadowski PD. 1998. Determi- 
nants of the position of a Flp-induced DNA 
bend. Nucleic Acids Res, 26: 140 1-7 

44. Nash HA. 1996. Site-specific recombina- 
tion: integration, excision, resolution, and 
inversion of defined DNA segments. In 
Escherichia coli and Salmonella typhimu- 
rium: Cellular and Molecular Biology, 
ed. FC Neidhardl, R Curtiss, JL Ingraham, 
EC Lin, KB Low, et al, pp. 2363-76. 
Washington, DC: ASM Press 

45. Nunes-Diiby SE, Azaro MA, Landy A. 
1995. Swapping DNA strands and sens- 
ing homology without branch migration in 
lambda site-specific recombination. Curr, 
BioL 5:139-48 

46. Nunes-Diiby SE, Matsumoto L, Landy A. 
1 987. Site-specific recombination interme- 
diates trapped with suicide substrates. Cell 
50:779-88 

47. Nunes-Duby SE, Tirumalai RS, Kwon HJ, 
Ellenberger T, Landy A. 1998. Similari- 
ties and differences among 105 members 
of the Int family of site-specific recombi- 
nases. Nucleic Acids Res. 26:391-406 

48. Pan H, Clary D, Sadowski PD. 1991. 
Identification of the DNA-binding domain 
of the FLP recombinase. J. BioL Chem. 
266:11347-54 

49. Redinbo MR, Stewart L, Kuhn P, Cham- 
poux JJ, Hoi WG. 1998. Crystal structures 
of human topoisomerase I in covalent and 
noncovalent complexes with DNA. Science 
279:1504-13 

50. Deleted in proof. 

51. Ringrose L, Lounnas V, Ehrlich L, Buch- 
holz F, Wade R, et al. 1998. Comparative 
kinetic analysis of FLP and Cre recom- 
binases: mathematical models for DNA 
binding and recombination. / MoL BioL 
284:363-84 

52. Roe SM, Barlow T, Brown T, Oram M, 
Keeley A, et al. 1998. Crystal structure 
of an octameric RuvA-Holliday junction 
complex. MoL Cell 2:361-72 



53. Sadowski PD. 1995. The Flp recombi- 
nase of the 2-microns plasmid of Saccha- 
romyces cerevisiae. Prog. Nucleic Acids 
Res, MoL BioL 51:53-91 

54. SauerB. 1993. Manipulation of transgenes 
by site-specific recombination: use of Cre 
recombinase. Methods EnzymoL 225:890- 
900 

55. Schwartz CJ, Sadowski PD. 1989. FLP re- 
combinase of the 2 microns circle plas- 
mid of Saccharomyces cerevisiae bends 
its DNA target. Isolation of FLP mutants 
defective in DNA bending. / MoL BioL 
205:647-58 

56. Schwartz CJ, Sadowski PD. 1 990. FLP pro- 
tein of 2 mu circle plasmid of yeast induces 
multiple bends in the FLP recognition tar- 
get site. J. MoL BioL 216:289-98 

57. Sherratt DJ, Arciszewska LK, Blakely G, 
Colloms S, Grant K, et al. 1995. Site- 
specific recombination and circular chro- 
mosome segregation. Philos. Trans. R. Soc. 
London Sen B 347:37^2 

58. Sherratt DJ, Wigley DB. 1998. Conserved 
themes but novel activities in recombinases 
and topoisomerases. Cell 93: 149-52 

59. Stark WM, Boocock MR, Sherratt DJ. 
1992. Catalysis by site-specific recombi- 
nases. Trends Genet, 8:432-39 

60. Stark WM, Sherratt DJ, Boocock MR. 
1989. Site-specific recombination by Tn3 
resolvase: topological changes in the for- 
ward and reverse reactions. Cell 58:779-90 

61. Sternberg N, Hamilton D, Austin S, 
Yarmolinsky M, Hoess RH. 1981. Site- 
specific recombination and its role in the 
life cycle of bacteriophage PI . Cold Spring 
Harbor Symp. Quant. BioL 1:297-309 

62. Subramanya HS, Arciszewska LK, Baker 
RA, Bird LE, Sherratt DJ, et al. 1 997. Crys- 
tal structure of the site-specific recombi- 
nase, XerD. £MJ?0 / 16:5178-87 

63. Tirumalai RS, Kwon HJ, Cardente EH, 
Ellenberger T, Landy A. 1998. Recogni- 
tion of core-type DNA sites by lambda in- 
tegrase. J. MoL BioL 279:513-27 

64. Van Duyne GD. 200 1 . A structural view of 



104 VAN DUYNE 



tyrosine recombinase site-specific recom- 
bination. In Mobile DMA II, ed. N Craig, 
R Craige, M Gellert, A Lambowitz. 
Washington DC: ASM Press 

65. Voziyanov Pathania S, Jayaram M. 
1999. A general model for site-specific 
recombination by the integrase family re- 
combinases. Nucleic Acids Res. 27:930-41 

66. Weisberg RA, Enquist LW, Foeller C, 
Landy A. 1983. A role for DNA homology 
in site-specific recombination: the isola- 
tion and characterization of a site affinity 
mutant of coll phage lambda. J. Mol. BioL 
170:319-42 



67. Wittschieben J, Shuman S. 1997. Mech- 
anism of DNA transesterification by vac- 
cinia topoisomerase: catalytic contribu- 
tions of essential residues Arg- 1 30, Gly- 
132, Tyr-136 and Lys-167. Nucleic Acids 
Res. 25:3001-8 

68. Yang W, Mizuuchi K. 1997. Site-specific 
recombination in plane view. Structure 
5:1401-6 

69. Zhu XD, Pan G, Luetke K, Sadowski 
PD. 1995. Homology requirements for lig- 
ation and strand exchange by the FLP 
recombinase. / Biol. Chem. 270: 1 1 646- 
53 



NH2 




Figure 3 Domain structure and DNA-binding of Cre recombinase. (a) Ribbon/cylinder 
drawing of one Cre subunit bound to a loxP half-site (representing one half of Figure I). 
HeHces are labeled as in (b) and active site residues are drawn as red sticks. The scissile 
phosphate is drawn as a red sphere, (b) Folding pattern of Cre recombinase. Helices A 
through E comprise the N-terminal domain, and the remainder forms the catalytic domain. 
A similar fold has been observed for the catalytic domains of X-integrase (33), HPl -integrase 
(27), and XerD recombinase (62). This fold forms a subset of the eukaryotic topoisomerase 
IB (12) and the yeast (Flp) recombinase (50) catalytic domains. The locations of conserved 
catalytic residues (Cre numbering) are indicated. Ribbon/cylinder illustrations in Figures 
3-7 were produced with the program RIBBONS (11). 



Figure 4 Structure of the synapsed substrates (or synapsed products) Cre-DNA complex 
(22). (a) View of the synapse from the amino-terminal domain side. Helices in the N- 
terminal domain of one subunit are labeled. Subunits colored green are active for substrate 
cleavage, (b) View of the synapse from the catalytic domain side, rotated 1 80 degrees about 
a horizontal axis with respect to (a). Helices in one of the active (green) subunits are labeled 
and the C-terminal helices-N that interact with adjacent subunits are circled. The helix swap 
is cyclic among the four subunits. 
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Figure 5 Structure of the Cre-DNA covalent intermediate (23). 5'-cytidine residues in 
nicked suicide substrates were cleaved and allowed to diffuse away in order to trap this 
intermediate (46). (a) View from the N-terminal domain side, as in Figure 4a. (b) The DNA 
alone in the cleaved intermediate structure, viewed from the catalytic domain side of the 
complex, as in Figure 4b. Missing 5'-cytidine residues and the direction of strand movement 
and nucleophilic attack during strand exchange are indicated. 



Figure 7 Structure of the Cre-HoUiday junction intermediate (17). (a) View of the complex 
from the N-terminal domain side, as in Figures 4a and 5a. (b) The DNA in the Cre-HJ 
intermediate structure viewed from the catalytic domain side as in Figures 4b and 5b. The 
formation of three base-pairs with the partner substrates as a result of strand exchange is 
indicated by arrows (compare with Figure 5b). Only the phosphodiester backbone of the 
exchanged bases needs to move in order to form the HJ intermediate. 
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Figure 9 Mechanism of Qxt-loxP site-specific recombination, based on the strand- 
swapping model of Nunes-Diiby et al (45) and on structural models of Cre recombi- 
nase/DNA complexes. Green subunits are active for cleavage in the top half of the pathway, 
and purple subunits are active for cleavage in the bottom half of the pathway. The DNA 
substrates lie nearly in the same plane and undergo only a subtle scissoring motion at the 
HJ isomerization step of the reaction that serves to switch the roles of the protein subunits 
and switch strands that are activated for exchange. The mechanism does not require branch 
migration of the HJ intermediate. 
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914 LANDY 

PERSPECTIVES AND SUMMARY 

On the 25th anniversary of Robin Holliday's model for single-strand ex- 
change intermediates in homologous recombination (1), it is especially fitting 
to review the most recent developments in the site-specific recombination 
system of Escherichia coli bacteriophage A. This article focuses on de- 
velopments since the most recent reviews of A recombination (2, 3), and has a 
different emphasis than a concurrent review (4) that explores the regulatory 
aspects of A recombination in more detail. The present state of the field has its 
roots in the wealth of elegant A genetics that rallied around Allan Campbell's 
model for viral integration (5). The purification of essential proteins by 
Howard Nash and his collaborators (6-8), and the definition of recombination 
target sites (9-1 1), allowed this system to be the first recombination pathway 
eligible for rigorous biochemical analysis. New results to be discussed in this 
review concern the dynamic, structural, and regulatory aspects of A 
recombination. They are summarized below. 

The dynamic aspects of A recombination (such as mechanisms of strand 
exchange) prominently feature the biochemical analysis of Holliday junction 
recombination intermediates. Artificial DNA constructs designed to model 
putative Holliday-junction recombination intermediates are efficiently and 
specifically resolved by purified A Integrase into normal recombinant prod- 
ucts (12). Holliday intermediates can also be trapped as the major product in a 
A recombination reaction using *'suicide recombination substrates" (13). The 
pair of reciprocal strand exchanges that first form and then resolve the 
Holliday junctions proceed in a strictly prescribed order that is the same for 
integrative and excisive recombination (13, 14). This order is determined by 
DNA sequences and protein binding sites distant from the region of strand 
exchange (13, 14). Formation and resolution of the Holliday junction pro- 
ceeds via a high-energy intermediate containing a covalent linkage between 
the DNA and Tyr-342 of Integrase protein (15). The requirement for a full 7 
bp of DNA homology between recombining partners (16, 17) appears not to 
be for synapsis but rather for the purpose of branch migration between the 
staggered strand exchange sites within the Holliday intermediate (13, 18) (B. 
DeMassy, R. A. Weisberg, personal communication). The synapsis of att 
sites is most likely mediated by protein-protein and protein- DNA interactions, 
and in integrative recombination this may involve a protein-decorated super- 
coiled "donor" (att9) pairing with a protein-free "recipient" {attB) (19). 

The structural aspects of A recombination 2u-e becoming interesting as 
models for other complex protein-nucleic acid systems, such as those found 
in regulation of transcription and DNA replication. The att site DNAs func- 
tion as part of higher-order protein-DNA complexes called "intasomes" (20, 
21). These are formed by a set of cooperative and competitive protein-protein 
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interactions involving four proteins and 15 binding sites. Formation of one of 
these intasomes (aaP) requires supercoiled DNA (22). The intasome structure 
is shaped largely by three sequence-specific "accessory" proteins (LHF, Xis, 
and FIS) that induce extremely sharp bends in DNA (23). The Integrase 
protein has the potential to tether distant sequences and form DNA loops by 
virtue of two autonomous DNA-binding domains with different recognition 
specificities (24). Integrase executes recombination (7) by a mechanism of 
strand cleavage and ligation that is shared with more than 15 related recom- 
binases (Int Family) (15, 25). 

The extreme directionality and regulation of A recombination are among its 
most striking features. Directionality is the consequence of two distinctly 
different pathways for integrative and excisive recombination. Each pathway 
uses a unique, but overlapping, set of the 15 protein binding sites that 
comprise citt site DNAs (26, 27). Cooperative and competitive interactions 
involving all four recombination proteins determine the direction of 
recombination. They also bestow upon the reactions sensitivity to host and 
viral physiology and to environmental conditions. Regulation of recombina- 
tion at the level of gene expression is augmented by a mechanism-based 
response in which host-encoded proteins are incorporated as integral elements 
of the recombination reaction. One of the host-encoded proteins (IHF) that is 
required for both integrative and excisive recombination has one binding site 
that must be occupied for integration and must be vacant for excision (28). 
The other host-encoded protein (FIS) dramatically stimulates excisive 
recombination when levels of the phage-encoded Xis protein are low (29). 
Intracellular levels of FIS, which is also an accessory protein for a different 
class of recombination reactions, drop by 70-fold as cells go from exponential 
to stationary phase (29). 

In addition to the recent developments outlined above, reference will also 
be made to FLP and Cre, two well -characterized members of the Int family. 
Although they are similar to Int in the basic mechanisms of strand exchange, 
they are quite different in terms of structural complexity, directionality, and 
regulation. 

BACKGROUND 

Conservative site-specific recombination is distinguished from homologous 
recombination and transposition by a high degree of specificity for both 
partners; the strand exchange mechanism involves the cleavage and rejoining 
of specific DNA sequences in the absence of DNA synthesis. Two major 
families comprise this class of reactions. The Resolvase-Invcrtase family is 
distinguished by the use of a conserved serine to form covalent intermediates 
with DNA, a constraint to intramolecular reactions, and an orientation prefer- 
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ence for recombination sites that can be either direct repeats (e.g. y8 and Tn-3 
Resolvase) or inverted repeats (e.g. the Hin, Gin, and Cin inversion systems). 

The other family of reactions (Int Family) (15, 25) is distinguished by the 
use of a conserved tyrosine to form covalent intermediates with DNA, the 
ability to execute both inter and intra molecular reactions, and the capacity to 
recombine sites with either direct or inverted orientation. Members of the Int 
family can be further subdivided into the minimal systems (e.g. FLP of yeast 
2fi circle and Cre of bacteriophage PI) and the complex systems, such as 
bacteriophage A and related viruses. 

The A recombination system consists of four recombination sites {att sites) 
and four proteins that carry out the integrative and excisive recombination 
reactions as shown in Figure 1 . The Integrase (Int) protein cuts and reseals 
DNA to carry out strand exchange. The other phage-encoded protein (Xis) is 
required only for excisive recombination. Of the two host-encoded proteins, 
IHF is required for both reactions, while FIS enhances only excisive 
recombination and only under special circumstances. Integrative recombina- 
tion between specific sites on the viral (att?) and bacterial {attB) chromo- 
somes generates prophage bounded by the recombinant products attL and 
attR. An excisive recombination between attL and auR regenerates attB and 
attP in the bacterial and excised viral chromosomes. Of the four att sites, attP 
(POP') is the most complex and attB (BOB') is the simplest (see Figures 1 
and 2 for the coordinates and protein binding sites). The prophage sites attL 
(BOP') and attR (POB') are "hybrids" of att? and attB (see Figure I). 

The reader is referred to other closely related, or overlapping reviews on 
regulation of A recombination (4), conservative site-specific recombination 
pathways (30, 31), Resolvases (32, 33), Invertases (34), enhancers (35), 
resolution of Holliday junctions (36, 37), FLP (38), topology (39-41), and to 
many related chapters in the books Transposition (42), Genetic Recombina- 
tion (43), and Mobile DNA (44), 

PROTEINS AND THEIR GENES 
Int 

Int is a basic protein of 40,330 Mr (45, 46) that has type I topoisomerase 
activity (6, 7, 15, 47, 48) and binds specifically to two different families of 
DNA sequences (49, 50). During recombination, it carries out the cutting and 
resealing of att site DNA via a covalent Int-DNA intermediate in the absense 
of any high-energy cofactors (8, 51, 52). Int can also transfer covalently 
bound DNA to the 5' OH of another DNA molecule in a sequence- 
independent reaction (S. E. Nunes-Diiby, A. Landy, unpublished results). Int 
is encoded in the A genome adjacent to the phage att site, reflecting the 
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Figure I Integrative and excisive recombination pathways. The protein binding sites for 
arm-type Int ( Q ), core-type Int ( IHF ( □ ), Xis ( ^ ), and FIS ( O ) 

are indicated by tilled symbols when that site is occupied by its cognate protein to make a 
competent recombination partner for integrative ( |J ) or excisive ( ) recombination. 

Proteins required for each reaction (Int, IHF, and Xis> are in bold, proteins that inhibit (Xis and 
IHF) or enhance (FIS) the indicated reactions are in italics. Of the two models for strand exchange 
([ ])» the sequential model (left) is correct. 
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common motif of recombinase genes that are very close to their sites of 
action. The specific arrangement of recombination genes often reflects their 
respective strategies for regulation of gene expression, which, in the case of A 
int, involves retroregulation (53-57), its own cll-dependent promoter (58), 
and transcription from the distant Pl promoter [reviewed in (4)], In phages 
P22 and P2, the int genes are oriented as in A, but in 080 and P4 they have the 
opposite orientation (59-61), The P2 int gene, which was once hypothesized 
to be split by the an site (62), has recently been shown to terminate its coding 
region just before att (61). 

Thus far only one classical int mutant (resulting in a Glu to Lys change at 
position 174) (63) has been well characterized biochemically. The mutation 
was isolated in two independent experiments: (a) by selection for A bacter- 
iophage that could undergo site- specific recombination in an £. coli host 
mutant for IHF (called int-h) (64); {b) by selection for an Int partially 
independent of Xis in excisive recombination (called xm) (C. Gritzmacher, L. 
W. Enquist, R. A. Weisberg, unpublished data). In the absence of IHF, 
where Inf^ activity is depressed approximately 500-fold, Int-h activity is only 
depressed approximately 10-fold (52). The fact that this reduced, but signifi- 
cant, Int-h activity is identical for supercoiled and relaxed substrates indicates 
that the normal supercoiling requirement for att? is determined largely by 
IHF. The two most plausible explanations for this phenotype are that Int-h/ 
Xin may have a higher affinity than Int"^ for core-type binding sites; or that it 
works differently on the synaptic intermediate, either to stabilize it or acceler- 
ate its conversion to recombinant product (65). A number of other int mutants 
that are interesting because they are recombination proficient (66) or domi- 
nant (63) have been studied but not thoroughly characterized. 

A comparative study of the primary structures of proteins from seven 
different bacteriophages pointed to an "Integrase family" of recombination 
proteins that now includes more than 15 members (15, 25). Within this family 
of proteins there is one region of approximately 40 amino acids near the 
carboxy-tenninus where at 25 positions more than 50% of the residues belong 
to the same amino acid exchange group (67). Particularly striking are three 
perfectly conserved residues within this region: His-308, Arg-311, and Tyr- 
342 (using the numbering of A Int). 

The biochemical significance of this region was determined using a family 
of suicide recombination substrates designed to accumulate transient 
recombination intermediates (13) (see below, synapsis and strand ex- 
change, and Figure 3). It was shown that Tyr-342 of Int forms a covalent 
bond with DNA at the sites of strand exchange. A mutant Int in which 
Tyr-342 is changed to phenylalanine is devoid of both topoisomerase and 
recombinase activity but still binds to both classes of Int DNA-binding sites 
with an affinity comparable to wild-type Int (15). The applicability of these 
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results to the Int family as a whole is supported by studies on FLP and Cre 
(68-70). In contrast, the Resolvase-Invertase family of recombinases forms a 
covalent linkage with DNA via a Ser(p) (71) located very close to the 
amino-terminus (35, 71-73) (see also background). 

The recombinases within the Int family are found in a rather wide range of 
biological functions, such as phage integration and excision, fimbrial phase 
variation, and plasmid maintenance in E. coli and yeast. Some members rely 
on accessory proteins while others are self-sufficient. Some members, like 
Cre and FLP, are recombinases with a single DNA recognition site, while 
others, like A Int, have two different DNA-binding domains. This intrafamily 
heterogeneity is reflected by the fact that only a single highly conserved 
40-amino-acid region is common to all members of the Int family. It has been 
proposed that the unifying feature of the Int family is the protein domain for 
the DNA cutting-ligating activity; the identification of Tyr-342 as the active 
nucleophile in Int lends strong support to this view (15). 

The fact that Int recognizes two classes of DNA-binding sites (49, 50) (see 
below and Figure 2), along with the biochemical differences in binding to 
each class (5 1 , 74), led to the proposal for a new kind of DNA-binding protein 
(50). The existence of two functionally distinct DNA recognition domains in 
the Integrase protein was demonstrated by the fact that core-type and arm-type 
DNA sequences do not compete with each other for binding in nuclease 
protection experiments (24). These results rule out the proposal (75) that all of 
the Int binding sites are recognized by Int on the basis of similar structural 
DNA features. They also indicate that an Int monomer is capable of binding to 
both classes of sites simultaneously. The "bivalency" of Int provides a simple 
structural explanation for the ability of some int mutants to negatively com- 
plement (63, 76-79). 

The identity and structural autonomy of the two DNA-binding domains was 
established by proteolytic cleavage of Int and footprinting analysis of the 
resulting two major peptides (24). A chymotryptic cleavage between Leu-64 
and Thr-65 generates a 32-kd carboxy-terminal fragment that binds core-type 
sites and a 7-kd amino-terminal fragment that binds arm-type sites. The 
amino-terminal fragment that binds exclusively to arm-type sites is not re- 
quired for catalytic function, while the carboxy-terminal peptide retains 
topoisomerase function and resolves synthetic att site Holliday junctions (24). 
This is consistent with earlier observations that Int can efficiently resolve 
Holliday stmctures, even in the absence of arm-type binding sites (12). 

Xis 

Xis is a small (M^ of 8630), basic, phage-encoded protein that is required for 
excision but not integration (45, 46, 80). Xis has no significant homologies 
with other proteins, including analogous excisionases from other lambdoid 
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phage, and nothing is known about its structure. While Xis is relatively 
thermostable in vitro, it is rapidly degraded intracellularly by an unknown 
protease, so that 80% of the activity decays within five minutes (80-92). Int, 
on the other hand, is quite stable intracellularly. 

In the presence of Int and IHF, Xis promotes efficient recombination 
between attL and attR, With greater than 100 mM salt, Xis is required for this 
reaction, while at low salt it is dispensable, although still stimulatory (83). 
Several lines of evidence indicate that the low-salt Xis-independent reaction 
proceeds by a different pathway than the Xis-stimulated reaction (84). A 
similar difference in pathways has been proposed for integrative recombina- 
tion at high and low salt with supercoiled versus relaxed att? (85). 

Xis has been found to confer enhanced thermostability on Int protein in 
vitro (80), but there is no evidence to indicate whether this reflects an 
interaction relevant to recombination. No catalytic activity has been associ- 
ated with Xis, and its function is exerted by sequence-specific cooperative 
binding to two adjacent sites in the P arm (26, 86). As discussed below, 
binding at these sites introduces a very sharp bend in the DNA and is also 
associated with cooperative interactions with DNA-bound Int and FIS (23, 
29, 87, 88). 

IHF 

Integration Host Factor (IHF), as the name implies, was discovered as a 
cellular function essential for A site-specific recombination, both integration 
and excision (89-92). IHF is a heterodimer composed of two subunits, with 
predicted molecular weights of 11,200 and 10,580 for the a and /3 subunits 
respectively (8, 93-95). Each of the subunits is very similar in sequence to the 
type II DNA-binding proteins, a family that includes the major histone-like 
proteins of E. coli (HU) and other bacteria [reviewed in (96)]. The only 
known (primary) function of IHF is its ability to bind (97-99) and to bend (23, 
I0Q-I03) DNA at specific sites. 

It may be valid to infer some basic features of IHF structure from a 
comparison with the HU protein from Bacillus stearothermophilus (HBs), 
whose crystal structure has been solved U> 3 A resolution (104). Both of the 
IHF subunits show significant homology to the subunit of the HBs homodimer 
(94), The HBs structure has two intertwined subunits that form a hydrophobic 
wedge-shape body and two long flexible arms extending from the wide 
portion of the wedge. It is proposed that the two flexible arms form a 
right-handed helix that could bind within the major or minor groove of 
double-stranded B DNA. 

Craig & Nash (97) have proposed that IHF binds in the minor groove of 
DNA based on the patterns of protection against modification by di- 
methylsulfate (DMS). H. A. Nash (personal conmiunication) has found addi- 
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tional support for this proposal in the patterns of protection against cleavage 
of the sugar-phosphate backbone by hydroxy 1 radicals. In these experiments, 
which were done on the three IHF sites of on DNA, as well as in nuclease 
protection experiments on other IHF sites (98, 99), IHF is found to protect 
three to four helical turns of DNA. Despite this relatively large protected 
region, preliminary stoichiometric data indicate that only a single IHF dlmer 
binds per site (H. A. Nash, personal communication). It is also interesting to 
note that while IHF has high sequence specificity, the closely related HU and 
HBs proteins have little or no sequence specificity. 

The genes for IHF have been mapped (89, 90), cloned (93-95, 105), and 
overexpressed (106). The a-encoding gene, himA, maps to minute 37, and 
the )3-cncoding gene, which is referred to as both hip and himD, maps to 
minute 20 on the E. coli chromosome. HiplhimD is part of a gene cluster with 
at least two promoters (E. L. Flamm, R. A. Weisberg, personal communica- 
tion). The himK gene is part of a more complex cluster whose transcription 
involves at least seven promoters and three terminators, with the regulation 
being influenced by growth rate and the SOS response. For a recent review of 
the regulation see (4), 

Some evidence suggests that levels of IHF may vary as a function of 
cellular physiology, but this point is not clear. A large increase in IHF levels 
is detected in stationary-phase cells, but this depends on the conditions used to 
extract the protein (27, 28). Attempts to resolve this question by in vivo 
chemical modification have not been definitive because all three of the IHF 
sites in attP that were monitored were fully occupied under the growth 
conditions tested (29). 

The lambdoid phages 080, P22, and A all have IHF-binding sites within 
their very different att site DNA sequences (98). 080, like A, is unable to 
lysogenize ^iimA ox hiplhimD mutants (107), while P22 has not been tested. 
Since the number of IHF-binding sites is different in each of the phage att 
sites, it will be interesting to compare the three recombination reactions at a 
mechanistic level. (Cre and FLP are two members of the Int family that do not 
require IHF.) Transposons are another class of recombination elements where 
a role forJHF has been demonstrated or implicated (99, 101, 108, 109). In 
some recombination systems an IHF requirement or HU requirement can be 
replaced with low efficiency by the other protein (108. 1 10). The roles of IHF 
extend well beyond recombination and are discussed thoroughly in a recent 
review (111). 

FIS 

A second host-encoded protein (in addition to IHF) involved in the A site- 
specific recombination reaction was first discovered by testing co// extracts 
for proteins that would bind to att site DNA in a gel mobility shift assay. Such 
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an activity was found and subsequently shown to be FIS (29), a protein that 
had been identified (112-114), and purified (115, 116), on the basis of its 
ability to stimulate three recombination inversion reactions from the Resol- 
vase-Invertase family. In the A system, FIS stimulates in vitro excision 
approximately 20-fold when Xis levels are limiting, however it cannot replace 
Xis (29). FIS has no effect when Xis is at saturating levels and it does not 
influence integrative recombination. The in vivo relevance of FIS has been 
established by demonstrating occupancy of its binding site in the P arm (29), 
as well as dramatic differences in excision efficiency for FIS^ and FIS" cells 
(R. Johnson, personal communication). When bound to its single binding site 
in the P arm, FIS induces a DNA bend of approximately 90° (23). 

FIS, like IHF, is a small, basic, heat-stable DNA-binding protein (115, 
1 16). Unlike IHF, which is a heterodimer, FIS is a homodimer in solution. 
The protomer, based on its gene sequence, has 98 amino acids and a calcu- 
lated pi of 9.5 (1 17, 118). There is no sequence homology between FIS and 
the type II DNA-binding proteins such as HU or IHF, despite their gross 
similarity in physical properties and somewhat analogous roles as bending and 
accessory proteins in several different recombination pathways. FIS has no 
cysteines or histidines so that a '*zinc-fingcr" mode of DNA recognition is 
unlikely (1 17). However, within the carboxy- terminal portion, at amino acids 
74—93, there is a sequence with a high probability of adopting the helix-turn- 
helix DNA binding motif (117, 118) that has been characterized for repressors 
and other proteins (119). 

The Hin system of Salmonella typhimurium, the Gin system of bacter- 
iophage Mu, and the Cin system of bacteriophage PI constitute a closely 
related family of FIS-dependent site-specific recombination systems. These 
systems specifically invert a segment of DNA and thereby mediate expression 
of two alternate forms of a flagellar antigen or phage tail fiber protein (34, 35, 
120), Their requirement for the bacterial protein FIS is mediated by a binding 
site on the DNA that has been called a "recombinational enhancer." The 
function of recombinational enhancers, which contain two or three FIS bind- 
ing sites, is relatively independent of position or orientation relative to the 
sites of recombination (112-114, 118, 121, 122). 

The fis gene has been mapped to approximately 72.5 min on the E. coli 
chromosome, between /a^E and aroE (1 17, 1 18). The fact that the initiating 
methionine is preceded by a poor Shine-Delgamo translation initiation site, 
coupled with the high number of rare codons (11%), may contribute to the 
low abundance of FIS in E. coli. A null mutation in fis leads to a lO^-lO'^-fold 
reduction of Hin and Gin inversion in vivo and a complete loss of detectable 
FIS activity when extracts are assayed for ability to stimulate inversion in 
vitro (117, 1 18). These null mutants also exhibit a 10^-10'*-fold reduction in A 
excisive recombination efficiency, suggesting that under these experimental 
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conditions, intracellular levels of Xis are limiting (R. Johnson, personal 
communication). Strains carrying these same null mutations show no reduc- 
tion in viability, even when this mutation is combined with a mutation in 
hip/himD (encoding one of the subunits of IHF). The inability to fmd major 
effects of fts mutations on cell physiology, or on the expression of other 
cellular genes, may reflect the short time these mutants have been available 
for study. 

One reason for predicting some interesting effects of FIS on cell physiolo- 
gy, and a possible role in global aspects of regulation, is that its intracellular 
concentration drops more than 70-fold as E. coli goes from exponential 
growth to stationary phase (29). It is not known what controls the degradation 
of FIS or whether gene expression is also involved. 



ATT SITE STRUCTURE AND PROTEIN BINDING SITES 

The most prominent feature of the four an site DNA sequences is a 15 bp 
"core sequence'' they have in common. Individual sequence elements of 
functional significance lie within, outside, and across the boundaries of this 
common core, which itself has no functional significance. Controlled resec- 
tion experiments established that an? extends from -150 to +85 from the 
center of the core, while atth extends from approximately -15 to +15 (see 
Figure 2) (10, 11, 123). The prophage sites generated by integrative 
recombination, attL and attK, are fully competent for excisive recombination, 
but they contain more than the minimal sequences necessary for excision (27) 
(see below, directionality and regulation). It is interesting to note that the 
attB sites of six different systems are located within the 3^ -terminal regions of 
various tRNA genes (60, 98) (W. Reiter, P. Palm, S. Yeats, personal 
communication). While this association with tRNA genes does not hold for 
the majority of att^ sites, it is significant and may offer some useful insights 
about evolution and/or mechanisms. 

The att? sequence is 73% A+T, with one 48 bp region being 90% A+T (9, 
10). As might be expected from this base composition, there are several runs 
of up to six consecutive adenines, a sequence feature known to generate 
intrinsic curvature within the DNA (124-126). The center of att? curvature 
has been mapped to -45 to -50 in the P arm (127). This same region is also 
hypersensitive to modification by bromoacetaldehyde, but only if the DNA is 
supercoiled (128). While the intrinsic curvature in this region of the P arm 
may enhance the efficiency of recombination, it is not absolutely required 
(127). Its function may be to position the att site to an external, easily 
accessible loop in plectonemically supercoiled DNA (129, 130). 

In vitro integrative recombination reactions traced the observed require- 
ment for negative supercoiling to the att? partner (131). Below 50 mM NaCl 
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the reaction does not require supercoiled attP (132); however, several lines of 
evidence suggest that this low-sah reaction with linear attP proceeds by a 
different recombination pathway (85). In contrast to integration, excision 
does not require supercoiling (133), although supercoiling of either attL or 
attR stimulates the reaction at limiting protein concentrations. Excision is also 
less sensitive than integration to salt concentration (132). 

Overlap Region 

The 7 bp between the staggered cuts responsible for strand exchange is 
referred to as the "overlap region" (3), since in recombinant molecules this 
region receives one DNA strand from each partner. The att site mutants of 
Shulman & Gottesman (134) that segregate to both recombinant products 
have been located within the overlap region by DNA sequencing (135). These 
mutants greatly impair recombination efficiency by deleting one bp from the 
overlap region. Insertions of one bp are not as detrimental (136), but a 
systematic test of spacing has not been carried out. Because this region 
receives one DNA strand from each partner during recombination, one would 
predict that it must have the same sequence in both partners. Indeed, many 
sequence changes can be accommodated within the overlap region as long as 
the same changes are introduced into both partners (17, 81). The role of 
DNA : DNA homology will be discussed below (synapsis and strand ex- 
change). 

An overlap region, defined both by the position of staggered cuts and by the 
requirement for DNA: DNA homology, is also found in the lox sites (Cre 
system) and FRT sites (FLP system), 6 bp in the former (137-141) and 8 bp in 
the latter (142-145). In both of these systems some specific sequences within 
the overlap region have been observed to reduce recombination despite the 
preservation of homology between the partners. The weaker effect of specific 
overlap region sequences in A att sites may be due to the dominating effects of 
the proteins bound to the P and arms. Similarly, in the absence of the 
asymmetry conferred by the P and P' arms of A, the orientation of lox and 
FRT recombination partners relative to one another is determined wholly by 
the overlap region sequence (139, 143, 145, 146). The Resolvase-Invertase 
systems also make use of staggered cuts during strand exchange, but the size 
of the overlap region is only 2 bp (71, 73, 112, 147, 148). 

Int Binding Sites 

There are seven Int-binding sites in attP, two of which define the outermost 
limits of attP (10, 49). Sequence inspection suggested that these sites could be 
grouped into two distinct families of sequences: "core-type" (also called 
"junction-type") sites that are adjacent to the points of strand exchange and 
"arm-type" sites that are distal to the region of strand exchange. This distinc- 
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lion was confirmed, and reliable consensus sequences were obtained for each 
family, by analyzing a number of fortuitous Int binding sites in non-a/r DNA 
(49, 50). As described above, each family is recognized by one of two 
autonomous DNA-binding domains of Int (24). 

CORE-TYPE SITES The core-type sites (C and C in attP and B and B* in 
attB) are found as inverted repeats at the two ends of the overlap region and 
include the phosphodiester bonds that are cutiby Int during strand exchange. 
The two base pairs flanking this phosphodiester bond are not specified in the 
consensus recognition sequence of seven bp (50). This lack of specificity may 
be a consequence of protein structural features in the catalytic region of Int. 
Int bound at the core-type sites protects DNA against attack by dimethyl 
sulfate in both the major and minor grooves along one face of the DNA helix; 
the inverted repeats face each other across the central major groove. The 
opposite face of the DNA helix appears to .be devoid of strong protein contacts 
(50). The appropriate configuration of core-type sites is the only requirement 
for the resolution of synthetic Holliday junctions by Int, or its carboxy- 
terminal domain (12) (C. Pargellis, B. Franz, and A. Landy, unpublished 
results). 

Although affinity constants for Int binding have not been measured, they 
are considerably lower for the core-type sites than for the arm-type sites, as 
estimated by their respective behavior in nuclease protection (19, 49) and gel 
mobility shift (23) experiments. The relative order of binding affinity for 
core-type sites isC'>C = B>B' (50). However, the differential in 
affinities does not appear to be essential, since different sites can be sub- 
stituted for one another with little or no effect on recombination (13, 26). The 
differences in affinities of the individual sites may be partially masked by the 
significant amount of cooperativity between Int proteins bound at two core- 
type sites (50). 

The amount of Int- induced DNA bending at the COC core-type sites has 
been estimated to be approximately 17° (23). However, this value must be 
viewed cautiously because of the relatively low affinity of Int for these sites 
(50). Additionally, measurements on the isolated core-type sites may not be 
relevant to the functional recombinogenic complexes (see below, coopera- 
tive interactions). 

Two core-type sites flanking the overlap region comprise at(B. This con- 
figuration is very similar to the minimal recombination sites of both partners 
in the Cre and FLP systems. The simplicity of auB explains the existence of 
secondary att sites in the E. coli chromosome. These sites are utilized for A 
integration (at greatly reduced efficiencies) when attB is deleted (149-151). 
Sequence comparisons of the secondary att sites indicates that they are 
degenerate facsimiles of the simple attB site. The efficiency of a particular 
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secondary att site is presumably determined by the extent to which it emulates 
the canonical spacing and sequence of the overlap region and core-type Int 
binding sites (3, 50). Similarly, fortuitous facsimiles of attE, or core-type 
sites, could comprise the targets for Int topoisomerase activity on supercoiled 
DNA not containing an att site. 

ARM-TYPE SITES The initial characterization of a distinct class of essential 
Int'binding sites well removed from the region of strand exchange was both 
intriguing and challenging in terms of recombinogenic structures and mech- 
anisms (49, 74). Evidence for how they function in site-specific recombina- 
tion is beginning to emerge. The five arm- type Int-binding sites have a 
consensus recognition sequence of seven contiguous base pairs (49). The 
approximate binding affinities for individual sites indicates a hierarchy of P' 1 
= PI > P2 (49, 87, 88). P'2 and P' 3 have not been examined in isolation and 
all three of the P' sites are subject to cooperative binding effects amongst 
themselves (L. Moitoso de Vargas, A. Landy, unpublished results). In addi- 
tion to these cooperative effects, sequences outside of the binding site can also 
influence the binding affinities. IHowever, the low affinity of Int for P2, 
which has the best match to the consensus sequence, is not due to context 
effects (87).] 

Classical genetic techniques have produced only one protein-binding 
mutant in the att sites. This mutation, hen, is in the P'3 site and changes a 
completely conserved C to T (152). It is especially interesting because it 
abolishes integrative recombination without affecting excisive recombination 

(152) . To examine the roles of other arm-type sites, the single base transition 
of the hen mutation has been introduced into each of the five arm-type sites 

(153) . Multi-site changes have also been introduced into a subset of the 
arm-type sites (27, 88). From these studies and resection experiments (10, 
11), it is clear that the PI and P'2, and P'3 sites are required for integrative 
recombination, while the P2 site is not. The apparent dispensability of P' 1 for 
integrative recombination is subject to the same reservation that applies to all 
point mutants without a phenotype in highly cooperative systems. 

In excisive recombination, none of the point mutations has as large an 
effect as seen with integrative recombination (153). The largest effect is 
observed with the P' 1 mutation, suggesting that this site is required for 
excision, with some of the defect being compensated by cooperative in- 
teractions with other sites. PI is clearly not required for excision, because 
when it is deleted the attR works as well or better than wild-type attR (27). 
While P2 is also dispensable under some circumstances, it does appear to be 
important for excision. In the absence of the P2 site, more Xis is required to 
obtain the same level of recombination as with a wild-type a/rR (27, 87). 
Additionally, the reaction between an attR-hen P2 and an attL-hen P*2 or 
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attL'hen P'3 is reduced relative to a wild-type attL (153). Both of these 
observations are reflective of cooperative interactions to be discussed below. 
See Figure 1 for a summary of the site requirements for integrative and 
excisive recombination. 

Xis Sites 

Xis protein protects a single region of approximately 40 bp in the P arm (86). 
Gel mobility shift assays have shown that highly cooperative binding between 
at least two Xis molecules takes place (26). Mutagenesis of the binding region 
showed that each half could bind Xis independently, although with lower 
affinity than the canonical site (26). Protection patterns against DMS mod- 
ification also provide evidence for two binding sites, denoted XI and X2, and 
further suggest that they are arranged as a direct repeat (86). Determining a 
consensus recognition sequence will require the analysis of fortuitous Xis 
recognition sites and/or a mutational study of the canonical sites. 

The Xis mutations isolated thus far include a single base deletion between 
the two sites and a series of resected att sites in which varying extents of the P 
arm have been replaced with heterologous DNA. In the first case, excisive 
recombination is abolished. In the second, it was shown that both XI and X2 
must be occupied; defects in XI are only overcome when sufficient Xis is 
present to bind to the mutated as well as the normal site (27). 

Xis binding to XI or X2 bends the DNA approximately 45*" and 95"*, 
respectively. When XI and X2 are both occupied, as is normally the case, the 
bending angle is greater than 140° (23). This very sharp bend is undoubtedly a 
key feature in the role of Xis during recombination. 

IHF Sites 

There are three IHF sites in attP: HI and H2 in the P arm and H' in the P' arm 
(97). IHF binding to multiple sites on the same DNA fragment is not 
cooperative (28, 88, 97, 101, 154). Derivation and refinement of a consensus 
recognition sequence has been aided by the analysis of IHF binding sites from 
a variety of phage, plasmid, and bacterial sources (97-100, 154-156). IHF 
protection patterns against modification by dimethyl sulfate further delimit 
those positions involved in binding (97, 100) (see Figure 2). 

As discussed above, and similar to Xis, the only known function of IHF is 
its ability to bend DNA (23, 100-103). The extent of IHF-induced bending is 
estimated to be greater than 140° at each of the three attP binding sites (23). 
The strong IHF-induced bend is consistent with the observations and analyses 
of DNA bending induced by CAP protein (157). The large effects (greater 
than 100- fold) on IHF binding affinity by sequence changes outside the 
consensus region (101) (L. Moitoso de Vargas, A. Landy, unpublished 
results) could be the result of changes in the bendability, and/or intrinsic 
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curvature, of the DNA. Similarly* variation in the amount of DNA protected 
by IHF at each of several different binding sites (97, 98) might reflect 
differences in the length of DNA involved in making the curve around IHF 
protein. The H2 and H' IHF binding sites, as well as some non-a« IHF sites 
(100), occur in regions where the DNA has a substantial intrinsic curvature in 
the absence of bound protein (23, 127), e.g. as determined by gel mobility 
shift permutation analyses (125). While intrinsic curvature may have the 
potential to favor IHF binding if it is in phase with the IHF-induced bend 
(157), it is not required. A number of IHF sites, such as HI , do not show any 
evidence of intrinsic curvature (23, 102). 

Despite the gross similarity of H I , H2, and H* in the extent of IHF induced 
bending, the three sites are not functionally equivalent in recombination. 
Whereas Hi is required for integrative but not excisive recombination, H2 
and H' are required for both reactions (26-28, 154). Additionally, it appears 
that the IHF requirement at H2 and H*, but not at HI, can be partially 
substituted by another protein, possibly HU (154). Progress toward answering 
some of the questions about IHF should come from the recent IHF-DNA 
co-crystals made with synthetic oligomer (K. Appelt, personal communica- 
tion). 

FIS Sites 

FIS binds to a single site, called F, that overlaps the X2 site in the P arm; the 
region protected against nuclease digestion is slightly larger than X2 (29). In 
contrast, the recombinational enhancers of the Hin and Gin family consist of 
two independent FIS binding sites separated by approximately 48 bp (113, 
121, 122). The FIS site in A an also seems to differ in sequence from the 
enhancer FIS sites (122). This lack of similarity suggests that the FIS recogni- 
tion determinants are not fully understood and may involve specific DNA 
conformations not readily apparent in a simple sequence alignment. 

Even though FIS can substitute for Xis binding at X2, the two proteins do 
not recognize the same features in the DNA: their respective DMS- 
modification protection patterns and the boundaries of the regions protected 
against nuclease digestion are different, and FIS does not bind to the XI site 
(29, 86). In order to separate the effects of FIS and Xis, the XI -X2 site was 
replaced with an XI -XI site in the P arm, thereby eliminating FIS binding 
while retaining the Xis binding (29, 127). This FIS" mutant made it possible 
to show that, in contrast to its stimulation of excision, FIS has no effect on 
integrative recombination. 

The effect of adding FIS to a nuclease protection or gel mobility shift assay 
is equivalent in many respects to simply adding more Xis. If Xis is limiting. 
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the addition of FIS leads to occupancy of the XI and F sites with the same 
cooperativity seen for the XI and X2 sites when Xis is increased in the 
absence of FIS. FIS binding to F is also similar to Xis binding to X2 in the 
extent of induced DNA bending, approximately 90" in both cases (23). 
Furthermore, the increase in the total apparent bend to more than 140° is 
similar for Xl-F occupancy and for X1-X2 occupancy. Finally, as discussed 
below, all of the cooperative interactions in which Xis participates with Int 
and IHF occur equally well when FIS is bound in combination with Xis. 

COOPERATIVE INTERACTIONS AND HIGHER-ORDER 
STRUCTURES 

The first evidence for the wrapping of DNA in a putative higher-order 
structure was the appearance of a 10-base-pair repeat in the nuclease protec- 
tion patterns of Int-DNA complexes (10, 49, 158). Such patterns of enhanced 
cutting suggest that one face of the helix is oriented toward the solvent, while 
the other face is less accessible to nuclease because it lies along a protein 
surface or forms the inside of a loop (159-162). Electron microscopy graphi- 
cally revealed large protein complexes at the phage att site (20, 163) that 
suggested the involvement of approximately 230 bp of attP DNA and 4-8 Int 
monomers (20). The term intasome was coined to describe these complexes 
that were considered to contain only Int and DNA (20). It is now used as a 
generic term for protein-arf site DNA complexes that also contain the other 
proteins involved in recombinogenic structures. While electron microscopy 
was not successful in revealing the importance of IHF in complex formation, 
it did show that Xis protein favored the formation of complexes involving 
attK and attF, and that many combinations of complexes could pair with one 
another (20, 21). 

The first evidence that the functional recombinogenic complex involves 
DNA wrapped in a nucleosome-like structure came from the topological 
studies of Pollock & Nash (132). Intramolecular recombination between two 
att sites (either ati? X ati^ or atth x attK) oriented in opposite directions on a 
circular supercoiled molecule inverts one segment of the circle with respect to 
the other and produces knotted products (164, 165). For integrative 
recombination, under conditions that minimized the knotting due to random 
interwrapping of the superhelical DNA, approximately one half of the recom- 
binant products are simple circles and the rest are knotted. For excisive 
recombination, all of the recombinants are simple circles. The excess knotting 
of the integrative recombination products is postulated to reflect the wrapping 
of DNA in the att? complex (132). Subsequent topological analyses of 
knotted and catenated recombinant products have confirmed this result and 
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indicate that the attP "intasome" is wrapped with a negative sign, which in the 
nucleosome cores of chromatin produces a left-handed solenoid (130, 166- 
168). 

Several lines of evidence suggest that the role of the supercoiling require- 
ment for integrative recombination is to facilitate formation of the attP 
intasome. Since only attP requires supercoiling, it is not likely that any step 
subsequent to synapsis, such as strand exchange, is responsible for the 
requirement (169). The singularity of an? is evidenced by the fact that 
supercoiling is not required for excisive recombination, aUhough it stimulates 
the reaction (170). Any degree of DNA wrapping in the attL or attR in- 
tasomes should also have a negative sign, since negative supercoiling of either 
one enhances the efficiency of recombination (133) (W. Bushman, J. Thomp- 
son, A. Landy, unpublished results). 

In a more direct biochemical analysis, it was shown that binding of Int to 
PI, and to a lesser extent to P'3, is more resistant to challenge by excess 
salmon sperm DNA when an? is negatively supercoiled (22). This effect 
requires the presence of IHF, and is seen only on attP (i.e. not when the arm 
binding sites are separated as in attL and attR). The good correlation between 
the sign and degree of supercoiling needed to promote these effects, and that 
needed to promote recombination, supports the notion that the primary role of 
supercoiling is to favor formation of the intasome (22). 

The importance of the spatial orientation of the PI and HI sites within the 
intasome is seen when the helical phase of these sites is altered with respect to 
the remainder of attP by making insertions or deletions in a nonessential 
intervening region (85). Under normal reaction conditions (where supercoil- 
ing is required), substrates with changes of integral multiples of the DNA 
helical repeat recombine, while substrates with nonintegral changes do not. 
With non supercoiled (linear) attPy in low salt conditions, both the helical 
phasing and the PI and HI sites are much less important (although the overall 
efficiency is lower than with supercoiled attP) (85). 

The specific interactions responsible for the intasome structures have been 
deduced by correlations of recombination with multiprotein nuclease protec- 
tion assays, competition binding studies, site-specific mutagenesis of protein- 
binding sites, and results from suicide recombination substrates. These in- 
teractions are summarized below and illustrated schematically in Figure 4. 

The pairwise cooperative interactions between Int molecules bound at the 
core-type sites was demonstrated by the fact that Int binds much better to 
core-type sites positioned as inverted repeats 7 bp apart (i.e. the canonical 
configuration) than to isolated core-type sites (50, 135). Similar cooperativity 
is also inferred from studies with artificial HoUiday junctions, but this con- 
figuration includes the potential interactions among Int protomers bound at 
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Figure 4 Schematic representation of some of the cooperative interactions observed with linear 
recombinogenic complexes. In some cases, cooperative interactions ( ^^^y ) and competitive 
interactions ( »))iiih«vv ) are dependent upon binding of another protein { m ) The outcome of a 
competition at high or low Int is indicated by a vacated ( >^ ) protein binding site. For more 
information about the individual protein binding sites see Figures 1 and 2. 
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each of the four core-type sites (B. Franz, A. Landy, unpublished results). 
Cooperative interactions among Int nnolecules bound at the core-type sites 
have been difficuh to study because Int has a low affinity for these sites. 

The most thoroughly characterized of the pairwise cooperative interactions 
involves Xis binding at the adjacent XI and X2 sites (26, 86). This rather 
strong cooperativity is of special interest for two reasons. First, it involves a 
head-to-tail arrangement of sites, which is not so common for cooperatively 
binding proteins. Second, the Xis-Xis cooperativity can be substituted by 
Xis-FIS cooperativity. Keeping in mind that Xis and FIS do not have similar 
amino acid sequences, and that they bind to different DNA sequences, it is all 
the more surprising that for every interaction tested, the Xl-F pair works as 
well as the XI-X2 pair (29). In the following discussion, all Xis interactions 
refer to Xis bound ^simultaneously at XI and X2, and are probably equally 
applicable to simultaneous binding of Xis at XI and FIS at F. 

Xis bound at XI -X2 (or Xl-F binding) cooperatively assists Int binding at 
P2. In nuclease protection experiments there is a 16-fold reduction in the 
amount of Int required to fill 1*2 if X1-X2 is also filled (26). This cooperativ- 
ity is also observed in recombinations involving a//R with a defective P2 or 
XI site (27, 87, 88). 

In addition to the cooperative interactions within the intasome, there are 
also several competitive interactions. These, however, are not simple pair- 
wise competitions, since any single protein can fill all of its binding sites, and 
any pairwise combination of proteins can fill all of their respective binding 
sites (26, 29, 49, 86, 88, 97). Int binding at P2 and IHF binding at the 
neighboring HI site are perfectly compatible in the absence of Xis. However, 
when X1-X2(F) is filled, then binding at HI is competitive with binding at P2 
(29, 88). Clearly, this competition is not due to simple occlusion of a binding 
site by neighboring proteins, given the pairwise compatibility of protein 
binding. Rather, the favored interpretation is a competition for adjoining 
space within a higher-order structure. 

In order to better understand the dynamics of the H1-P2 competition, it is 
necessary to also refer to a complex cooperative interaction involving the PI 
site (88). In linear molecules, Int binding to Pi in attR is strongly enhanced 
by the simultaneous occupancy of HI, H2, and X1-X2(F). However, in attP 
this cooperativity is not observed (the P' arm is involved in a different set of 
cooperative interactions that includes the core-type Int binding sites) (L. 
Moitoso de Vargas, A. Landy, unpublished). Thus, the binding affinity of Int 
for PI in a higher-order complex is determined by interactions occurring on 
DNA at least 150 bp distant (see Figure 4). Richet et al (22) have found 
IHF-dependent cooperative interactions involving the PI site in an? that 
require supercoiling. Thus, two major features that distinguish integrative and 
excisive recombination (the supercoiling requirement for att? and the Xis 
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requirement for attR) are also needed for the long-range interactions involving 
the PI site. 

The above discussion indicates that each partner in the HI vs P2 competi- 
tion is involved in a separate set of cooperative interactions. Therefore, the 
outcome of the competition is determined by the configuration of an site 
arms, i.e. attR vs attP, by the relative concentrations of each protein, and by 
the topological state of the DNA. As will be discussed (directionauty and 
regulation), the cooperative transition in binding of proteins from the PI, 
HI , X1/X2(F), H2 set to the P2, X1/X2(F), H2 set (see Figure 5) occurs over 
the same range of protein concentrations that is required to overcome IHF 
inhibition of excision and to favor efficient excisive recombination (28, 88). 

In contrast to the complexity of the P arm, the relative simplicity of the P' 
arm highlights one of the primary interactions responsible for intasome 
structure. In atth, IHF bound at H' enhances binding of Int to core-type sites 
as long as the P* arm-type sites are present. This same cooperative interaction 
is reflected in an (H*-P')-enhanced nicking of attL suicide recombination 
substrates. In attL, both the IHF-enhanced Int binding at core-type sites and 
recombination proficiency depend upon the correct helical phasing of the P* 
arm-type sites with respect to the core region (L. Moitoso de Vargas, S. H. 
Kim, A. Landy, unpublished). These cooperative interactions lead to the 
architectural element shown in Figure 6 and discussed below. 

The number and complexity of the intasome-forming interactions stands in 
sharp contrast to the compact "minimized" reactions of some other Int family 
members such as Cre and FLP. Since these are highly efficient site-specific 
recombinases, it is appealing to ascribe much of the additional complexity of 
the A system to the requirements for directionality and regulation, as dis- 
cussed below. 



high [INT] 



low [int] 



PI HI P2 X1X2 



att P 



att R 




Figure 5 Coordinate occupancy of protein binding sites in the P arm of attR and att? as a result 
of cooperative and long-range interactions. Filled symbols indicate occupancy of the site by its 
cognate protein (PI and P2 by Int, HI by IHF, X 1X2 by Xis). Double-headed arrows highlight 
the differences between attP and attR. 
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Figure 6 Basic design element of an intasome. IHF induces a sharp (>I40*') bend in DNA to 
bring the distant arm-type and core-type sites close enough to be bound by an Int protomer {cis 
interaction). The bivalent Int can also be involved in (trans) interactions between the arm-type 
and core-type sites on different partners. Xis and FIS are postulated to play a similar role to IHF, 

SYNAPSIS AND STRAND EXCHANGE 
Covalent Intermediates 

The sites of strand exchange, i.e. the locus of cutting and resealing individual 
DNA strands, was narrowed down to two adjacent phosphodiester bonds, in 
an elegant isotope transfer experiment by M. Mizuuchi and K. Mizuuchi 
(171). Resolution of the remaining ambiguity came from sequence com- 
parisons of many secondary an sites (3, 50) and from the demonstration by 
Craig & Nash (51) that Int makes a covalent link with the 3* phosphate at the 
site(s) of strand exchange (reviewed in Ref. 3). 

In order to carry the analysis of transient recombination intermediates 
further, two families of suicide recombination substrates proved useful (see 
Figure 3) (13, 15). These att sites, which contain a strategically placed nick 
within the overlap region, initiate recombination normally and become de- 
viant only after Int has acted on them. They are unable to complete or reverse 
the reaction. The particular intermediate trapped by such substrates depends 
upon where the nick is placed within the overlap region. One family of suicide 
substrates contains a medial nick in the overlap region (centered between the 
two sites of strand exchange) and leads to the formation of covalent Int-DNA 
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complexes in high yield. This work showed that Tyr-342 of Int forms a 
covalent link with DNA as an intermediate during strand cleavage and 
religation (15). The other family of suicide substrates is discussed below. 

The Order of Strand Exchange 

The existence of covalent Int-DNA intermediates does not distinguish be- 
tween the two possible mechanisms of strand exchange: a concerted mech- 
anism, in which both DNA strands in each att site are cleaved prior to strand 
exchange; and a sequential mechanism, in which one DNA strand in each an 
site is cleaved and exchanged followed by a second cycle of cleavage and 
exchange (see Figure 1). 

From genetic crosses, it was found that Int-promoted recombination in vivo 
can generate the progeny predicted by a sequential strand exchange mech- 
anism (172, 173). However, the low yield of these progeny was cited as 
evidence against their being on the primary recombination pathway (128). In 
vitro data that were interpreted to favor a concerted mechanism included the 
observation of low levels of double strand breakage by Int protein (128, 174). 
Support for a sequential strand exchange mechanism came from the specific 
resolution of synthetic att site Holliday junctions by purified Int protein to 
yield completed recombination products (12). The high efficiency and 
specificity of the resolution reaction was persuasive but not compelling. 

Two different experimental devices finally led to the isolation of the elusive 
single-strand exchange recombination intermediates. One of these is the 
second family of suicide recombination substrates discussed above. In this 
family of suicide substrates, the preexisting att site nick is located at either the 
left (top strand) or right (bottom strand) Int cleavage site of the a/rB partner 
(see Figure 3), When the nick is in the bottom strand, single-strand exchange 
intermediates are produced under normal reaction conditions with almost the 
same efficiency as recombinant products from normal substrates. However, 
when the nick is in the top strand, no intermediates or recombinant products 
are formed (13). Thus, recombination proceeds via a sequential mechanism of 
strand exchange and the order is fixed such that the top strands must exchange 
first. Blocking the top strand exchange with a nick prevents the reaction from 
initiating. Using suicide recombination substrates for an attL x attK reaction, 
it was shown that excisive recombination has the same prescribed order of top 
strand exchange followed by bottom strand exchange (13, 14). 

Similar conclusions have been drawn for integrative recombination (exci- 
sion was not tested) from experiments in which substitution by phosphor- 
othioate nucleotides into att site DNA was used to block Int cleavage (14, 65). 
Substitution in the top strand blocked strand exchange, while substitution in 
the bottom strand generated Holliday intermediates. In these experiments, 
however, very low levels of the recombination intermediate were seen, even 
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when using the Int variant known as Int-h (see above) to increase their yield. 
This difference between the analogue substitution and suicide recombination 
substrates is infornnative, especially when coupled with the low levels of 
Holliday structure obtained in nomnal reactions, or those blocked by a heterol- 
ogy in the overlap region (see below) (13, 14, 18). These differences have led 
to the suggestion that the first strand exchange during integrative recombina- 
tion is reversible. Thus, in order to obtain high yields of recombination 
intermediate, it is not sufficient to block forward progression of the reaction, 
but it is also necessary to block its reversal (13). Understanding which of 
several possible mechanisms is responsible for the ability of the suicide 
substrate to trap Holliday junction intermediates is likely to provide useful 
insights into the normal reaction. Thus far, results obtained with a variety of 
different suicide recombination substrates indicate that Int protein, when 
provided with an appropriate 5 * OH acceptor, will efficiently execute a single 
DNA strand transfer reaction (13) (S. E. Nunes-Duby, A. Landy, un- 
published results). 

The strict order of strand exchanges does not arise from asymmetry in the 
core region, but rather from the asymmetric arrangement of proteins bound to 
the P and P* arm sequences: (a) the core regions in the suicide recombination 
substrates were constructed to be virtually symmetric (13); {b) the same order 
of strand exchanges is obtained with att sites having different core region 
sequences (13, 14); (c) inversion of the overlap (13) or the full core region 
(14) does not change the order of strand exchange relative to the P and P* 
arms. 

Holliday recombination intermediates have also been detected in the Cre 
and FLP systems (141, 145, 175). Thus, it is likely that this mechanism of 
sequential strand exchanges is common to members of the Int family. 

DNA Homology 

Although the specific sequence of the overlap region is not critical (within a 
wide latitude) for the order of strand exchange, or for the overall efficiency of 
recombination, it is required that the overlap region have the same sequence 
in both recombination partners (16, 17, 136). Two models have been pro- 
posed for the role of DNA : DNA homology in the overlap region. In the first, 
homology is required for synapsis of the recombining partners, and more 
specifically for the formation of a four-stranded DNA structure (7, 176, 177). 
From this view of recombination, considerable attention has been focused on 
whether the interstrand interactions are likely to be of the type proposed by 
McGavin (178) or by Wilson (179), and the likely direction in which strands 
rotate around the four-sided block during recombination (166, 177). In the 
second model, which is strongly favored by recent data, DNA: DNA homolo- 
gy is not required for synapsis but rather for some later step, such as branch 
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migration of a Holliday junction (16). Experiments with att sites that are 
heteroduplex within the overlap region argue against mechanisms involving a 
homology-dependent annealing of cohesive ends (180), however they do not 
distinguish between these two nnodels. 

A distinction between the two models is afforded by the experimental 
devices described above for obtaining single-strand exchange intermediates 
containing Holliday junctions. Whereas the formation of single-strand ex- 
change intermediates is completely blocked by a nonhomology on the left side 
of the overlap region, it is not affected by a nonhomology on the right (13, 
18). When the center of the overlap region is tested, single-strand exchange 
intermediate formation is blocked by a nonhomology at position 0 but not at 
position + 1 (18). These results rule out that class of models, including that of 
four-stranded DMA, in which homology over all (or most) of the overlap 
region is required for synapsis. However, they do not exclude a requirement 
for a local homology of several base pairs in the immediate vicinity of a strand 
exchange site. 

Whether or not strand exchange depends on some very short local 
homologies, a proposed requirement for a homology-dependent branch 
migration step is attractive (16). Artificial att site Holliday junctions have 
been made with sequence nonhomologies in the overlap region designed to 
restrict branch migration. For some of these constructions, resolution is 
strongly biased in favor of parental-type products, in contrast to wild-type 
Holliday structures that resolve to parental and recombinant-type products 
without pronounced bias. These results suggest that heterology prevents the 
Holliday junction from migrating from one Int cleavage site to the other and 
that resolution is executed by Int if the junction is at, or near, the Int cleavage 
site (B. DeMassy, L. Dorgai, R. A. Weisberg, personal communication). 
However, some data from these experiments suggest that Int may not be able 
to cleave when the junction is constrained very close to the cleavage site. The 
DNA branch migration model also gains further credibility from its ability to 
explain the results of recombination in which one partner is heteroduplex in 
the overlap region (17, 180). 

Synapsis 

Additional evidence that DNA homology is not required for synapsis comes 
from studies on both integrative and excisive recombination. Heteroduplex 
att% was incubated with a supercoiled homoduplex a//P under recombination 
conditions; although the two att sites cannot recombine, because of 
nonhomologies between their overiap regions, double-strand cleavage of af/B 
is promoted by the att? intasome. Thus, capture of a«B by the att? intasome 
does not depend upon DNA homology between the sites (19). In addition, 
chemical footprinting assays establish that under recombination conditions, 
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attB cannot stably bind Int in competition with other DNAs. Taken together, 
these experiments suggest that attE may obtain its Int by binding to a 
preformed attP intasome in a synapsis that is governed primarily by protein- 
protein and protein-DNA interactions and is independent of DNA homology 
(19). Thus the asymmetry in structural complexity is indeed reflected in 
functional asymmetry. For the analysis of excisive recombination, half-a/r 
sites were constructed by cutting at a restriction site within the overlap region. 
These half-att sites are proficient for synapsis and strand transfer to an intact 
att site partner despite the lack of homology between their overlap regions (S. 
E. Nunes-Duby, A. Landy, unpublished results). 

A particularly challenging aspect of synapsis is how two recombining att 
sites initially come together and are productively oriented with respect to one 
another. The fact that recombination can occur intermolecularly, in- 
tramolecularly, and between directly repeated or inverted sites suggests that 
att sites come together by random collision rather than by some form of 
tracking along the DNA. This general view is substantiated by topological 
analyses of the knots and catenanes generated by recombination: the topologi- 
cal complexity of recombinant products increases with both the density of 
supercoils and the distance between the att sites (130, 132, 164, 166, 181), 

In contrast to the Int family, the Resolvase-lnvertase family has more 
stringent orientation requirements and yields topologically invariant 
recombination products (33, 34, 39, 40). These features are consistent with a 
class of models in which the two recombination sites are interwrapped in a 
very specific manner, a requirement that is equivalent to a "topological filter*' 
(182-185). From the topological experiments, it is clear that A att sites do not 
have such an interwrapping requirement for productive synapsis. [Although 
an interesting exception may be the Xis-independent attL x attR recombina- 
tion in low salt (84). J The usefulness of topological analyses of recombinant 
products lies mainly in its ability to set limits on acceptable models (40, 130, 
166) when the number of biochemical unknowns is small. However, any 
acceptable mechanism for synapsis and strand exchange must be validated by 
its ability to explain the topological consequences of recombination. 

Several features of the A system are quite similar to those found in FLP and 
Cre, while others appear to be influenced by the unique arms of A att sites. 
Both FLP (68, 186-188) and Cre (189-191) share with A all those properties 
that suggest a random collision -type of synapsis. Cre, however, appears to 
exhibit an additional constraint in synapsis, suggesting some interwrapping of 
lox sites (189, 191 , 192). However, it is unlikely that this aspect of synapsis 
reflects a fundamental difference in mechanism, since relatively small muta- 
tional changes in either Cre or lox can make the Cre system topologically 
similar to FLP and Int (192). 
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DIRECTIONALITY AND REGULATION 

One of the unique and intriguing aspects of the site-specific recombination of 
A (and related phages) is its extreme directionality. Although the pathway is 
commonly diagrammed as a reversible integration/excision reaction, and the 
normal products of one reaction are the normal substrates of the other, such 
notation belies the fact that there are in fact two distinct pathways. Integrative 
recombination normally depends upon supercoiling for assembly of the attP 
intasonie (22, 169), while the attL and attR intasomes do not have this 
requirement (although they are stimulated by it) (133). Integrative recombina- 
tion requires higher levels of Int than excisive recombination, both in vivo 
(193) and in vitro (when attR is supercoiled) (27, S. E. Nunes-Diiby, A. 
Landy, unpublished results). Xis protein is required for excisive recombina- 
tion under physiological conditions, while it is inhibitory for integrative 
recombination (80, 83, 194). The HI site must be occupied by IHF for 
integrative recombination and it must be vacant for excision (27, 28, 88, 
154). Of a total constellation of 13 protein binding sites, PI, HI, and P'3 are 
uniquely required for integrative recombination, and P2, XI, X2, F, and P* I 
are uniquely required for, or involved in, excisive recombination (27-29, 87, 
88, 153), Indeed, deletion of the PI and HI sites, which are required forattP 
function, makes an attR that can be more efficient than wild type (27). It is 
clear that the A att sites have evolved to efficiently execute one reaction or the 
other in a mutually exclusive manner. 

The regulation of phage integration and prophage excision consists of a 
sophisticated network of pathways involving multiple phage and host genes. 
A detailed description of how these pathways direct recombination, and some 
models for their regulation, have recently been presented (4). Several features 
of these models are summarized below. 

In contrast to the analysis of gene expression, regulation at the level of the 
recombination reaction itself has been difficult to study in vivo. The in vitro 
data, and resulting models pointing to mechanism-based regulation, have to 
be tested by in vivo experiments specifically designed to detect such effects. 
For example, the large effect of FIS on in vivo excision, discussed above, was 
not detected until specifically looked for. The other mechanism-based regula- 
tion by a host-encoded protein involves IHF and the HI binding site. The 
effect of IHF inhibition of excision is to greatly sensitize the reaction of Int 
concentration via the cooperative and competitive interactions described 
above (28, 88). Under one set of in vitro conditions, IHF inhibition at 
HI sharpens the Int concentration dependence of excision by more than 
lO-fold (28). 

The two host-encoded proteins involved in A site-specific recombination 
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both exert regulatory effects on excision; FIS stimulates and IHF inhibits. It is 
significant that both regulatory responses are overridden by high levels of 
phage-encoded proteins: high Xis levels abolish FIS stimulation and high Int 
levels overcome IHF inhibition. These mechanisms endow the prophage 
excision reaction with a responsiveness to host physiology, viral physiology, 
and a range of environmental conditions (4, 28, 29). When there is a strong 
inducing stimulus, such as heavy UV irradiation, death of the host cell is 
imminent and it is advantageous for the prophage to excise and initiate a lytic 
cycle without delay or modulation. These are the conditions when levels of Int 
and Xis are expected to be high. When the inducing stimulus is weak and cell 
death is not certain, it is advantageous for viral excision and lytic develop- 
ment to be influenced by other factors, such as the likelihood of completing a 
lytic cycle. These are the conditions when Int and Xis levels may be low. 
Additionally, in the unfavorable conditions of stationary phase, FIS levels are 
low and excision is more difficult (requires higher levels of Xis). 

The ability of FIS to stimulate excision at low levels of Xis may also be 
relevant in the phenomenon of spontaneous phage production, which occurs 
at a frequency of one in 10^ to 10^ cell generations. Some "spontaneous" 
phage production probably results from DNA damage and a strong induction 
in a small fraction of cells. Additionally, some cells may produce phage as a 
consequence of random fluctuations in repressor concentration, leading to a 
weak induction. It has been suggested (4) that the possible role of FIS in 
assisting this low-frequency pathway is not unlike the FIS stimulation of 
recombination of other low-frequency recombinations (112, 113, 1 15, i 16). 
Both the hin inversion system of Salmonella flagellar antigens (195) and the 
gin inversion system of bacteriophage Mu tail fibers (196) also involve 
fluctuations in the synthesis of a poorly expressed recombinase. The stimula- 
tion of all three low-frequency reactions by FIS suggests that they will occur 
preferentially in exponentially growing cells, 

COMMENTS AND SPECULATIONS 
Cooperative Systems 

In retrospect we can see why it was so difficult to isolate att site defective 
mutants (despite the existence of many potential targets for disruption). In 
highly cooperative complexes like intasomes, it is likely that small changes in 
specific protein-binding sites will be masked by the remaining functional 
sites. Indeed, even with the ability to make site-directed mutations, the 
phenotypes can be misleading. A two-base change in the consensus recogni- 
tion sequence, which reduced IHF binding to the HI site by 500-fold, yielded 
little change in phenotype. Only after the introduction of five base changes 
could it be determined that the HI site is required for integrative recombina- 
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tion (28). Thus, in highly cooperative systems, considerable caution must be 
exercised in interpreting null phenotypes. Another example of confusing or 
misleading phenotypes was the observation that, under certain recombination 
conditions, deletion of the P2 Int binding site produced much less of a defect 
than a multisite mutant that destroyed its function. In fact, the interplay of 
competitive and cooperative binding in the P arm explains this surprising 
result, and is substantiated by biochemical analysis of the two mutants (87). A 
final example of considerations relevant to complex cooperative systems 
concerns the conditions under which apparently independent binding sites are 
actually used. The most prominent instances of this are the P arm binding site 
interactions (88), and the recent finding that some (or all) of the core-type 
sites do not obtain Int from solution (19), but through delivery by arm-type 
sites (L. Moitoso de Vargas, S. H. Kim, A. Landy, unpublished results). 

We have also learned that optimization of an in vitro system can result in 
overlooking some of the regulatory and biochemical complexity that is impor- 
tant under nonoptimal conditions. Indeed, nonoptimal conditions are preva- 
lent in nature. The effect of FIS protein could not have been detected in an in 
vitro system optimized for Xis concentration. 

Dynamic Aspects 

In considering how two att sites might synapse, it is useful to summarize the 
features of Int binding in the region of strand exchange. Both of the 
phosphodiester bonds that are cut during exchange of each strand, and all of 
the contacts for Int binding, are on the same face of the DNA helix, straddling 
the central major groove of the overiap region (50). The recent demonstration 
that synapsis does not require DNA homology across the entire 7 bp overiap 
region (13, 18, 19) rules out models of four-su-anded DNA and eliminates the 
motivation for proposing extensive DNA-DNA interactions in the early steps 
of synapsis. This makes even more appealing the view that the Int proteins lie 
on the inside faces of a synaptic complex where they can interact with one 
another. One particularly attractive configuration is that of two helixes lying 
across one another (as opposed to a side-by-side alignment). This would serve 
to maximize the interaction between all of the bound Int proteins, as sug- 
gested by some preliminary data on the resolution of synthetic Holiiday 
junctions and the properties of certain suicide recombination substrates (B. 
Franz, S. H. Kim, S. E. Nunes-Duby, A. Landy, unpublished results). It 
would also minimize the repulsive forces between the two DNA helixes. This 
view of the synaptic complex, with an "interior" cluster of Int molecules in 
the region of strand exchange, is similar to that suggested for y8 Resolvase on 
the basis of recent mutant and crystallographic data. These indicate a 
tetrameric complex of Resolvase monomers with the active-site serines on the 
outer surface (T. Steitz, N. Grindley, personal communication). 
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Structural Aspects 

The sharp (- 140°) bends induced at the loci of binding by IHF, Xis, and FIS, 
along with the bivalent DNA-binding capacity of Int, are probably the major 
determinants of intasonie structure. Intrinsic DNA curvature, as a conse- 
quence of A tracts (124-126), is also likely to contribute to the overall 
structures. Within the intasome, these elements are likely to work in conjunc- 
tion to form DNA loops that are tethered by an Int, binding simultaneously to 
arm-type and core-type sites. The delivery of Int to the low-affinity core-type 
sites, by the higher-affinity arm-type sites, would be facilitated by the protein- 
induced DNA bending. Binding at core-type sites could also be assisted 
through secondary (protein-protein) interactions. For example, one Int bound 
to both arm- and core-type sites could bring another Int to the core region, 
either on the same att site or on the synapsed partner. This view is also 
consistent with the cooperativity of Int binding in the core regions of all four 
an sites (50) (B. Franz, S. H. Kim, S. Nunes-Diiby, L. Moitoso de Vargas, 
A. Landy, unpublished results). 

This model of promoting binding to the core-type sites by the formation of 
DNA loops that deliver Int bound at arm-type sites predicts specific pairwise 
interactions between each core-type site and an arm-type site, or between two 
core-type sites. These interactions could involve sites within the same partner 
{cis interactions), and/or sites on different partners {trans interactions) (Figure 
6). Similarly, the cooperative interactions between proteins bound at core- 
type sites could be cis and/or trons. Identifying these pairwise interactions is 
one of the challenges of understanding the intasome structures. A related 
challenge is characterization of the interactions between DNA and the site- 
specific DNA bending proteins IHF, Xis, FIS. 

Regulatory Aspects 

Bacteriophage A has long served as a divining rod for uncovering host 
functions. Yet another example may be the host-encoded IHF and FIS pro- 
teins. It is tempting to speculate that the mechanism-based regulation of 
recombination incorporates IHF and FIS as integral elements of the reaction, 
because these two proteins reflect global signals of E. coli physiology. This 
suggestion is appealing because of the striking variation in FIS levels as a 
function of growth phase (29) and the wide range of reactions in which IHF is 
involved (111), 

The role of FIS in stimulating excision has led to speculation about a new 
phage regulatory pathway that could maintain derepression even under weak 
inducing conditions (4, 29). After excision, new genes from the b region are 
expressed by N-dependent transcription through att?. One of these {ben) 
encodes a double-stranded DNA endonuclease (197, 198) that has been 
purified and found to cleave only supercoiled DNA (199). This endonuclease. 
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or one of the others encoded in the b region (22, 200, 201), could play a role 
in prolonging (inducing) an SOS response by cleaving the bacterial genome 
and generating new sites that activate RecA protein. The activated RecA 
would promote continuous cleavage of A, thus preventing repression of the 
newly excised phage. 

The model proposes a function for the enigmatic b region (202) and, more 
importantly, it suggests how FIS could enhance excision, even under con- 
ditions that might otherwise lead to repression of the excised phage. Accord- 
ing to this model, spontaneous phage production does not reflect the lower 
limits of prophage repression, but rather is a programmed strategy in main- 
taining phage populations. In the absence of a pathway devoted to maintain- 
ing derepression (under weak inducing conditions), the dramatic effects of 
FIS on prophage excision would be primarily relevant to normal induction 
conditions. 

CLOSING THE CIRCLE 

Analysis of the A recombination system has benefitted greatly from Robin 
Holliday*s provocative insight about possible intermediates in the exchange 
(or rearrangement) of genetic information. In return, studies of the A pathway 
have contributed significantly to understanding more about the formation, 
properties, and resolution of these genetic intermediates. The Holliday struc- 
tures were conceived in the context of homologous, not site-specific, recom- 
bination — but such extrapolations are commonplace. One example is the 
possible relevance of intasome structures to the complex organizations in- 
volved in DNA replication, transcription, and other recombination systems. 
Understanding how the accessory proteins IHF, Xis, and FIS bend and shape 
DNA and how the bivalent Int protein tethers two distant DNA sites is likely 
to be of interest well beyond the field of site-specific recombination. Another 
example is the possible window into global regulatory devices that is afforded 
by the discovery of IHF and FIS proteins and their role in mechanism-based 
regulation of recombination. 

The circle is always closing but it is never closed. 
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